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The U.S. Government has certain rights in this invention pursuant to grant numbers 
AI-21 707 and HD-29687 from National Institutes of Health. 



The invention relates to novel Haemophilus adhesion proteins, nucleic acids, and 
antibodies. 



Most bacterial diseases begin with colonization of a particular mucosal surface 
(Beachey et al., 1981, J. Infect. Dis. 143:325-345). Successful colonization requires 
that an organism overcome mechanical cleansing of the mucosal surface and evade 
the local immune response. The process of colonization is dependent upon 
specialized microbial factors that promote binding to host cells (Hultgren et ai, 
1 993 Cell, 73:887-901 ). In some cases the colonizing organism will subsequently 
enter (invade) these cells and survive intracellularly(Falkow, 1991, Cell 65:1099- 
1102). 

Haemophilus influenzae is a common commensal organism of the human respiratory 
tract (Kuklinska and Kilian, 1 984, Eur. J. Clin. Microbiol. 3:249-252). It is the most 
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„,„,o„ cause of baceria, .neningius and a leading cause of o*er invasive 
aTi:: di3«^. .n addition, *is..ani.n, is .sponsiHe fo, a sizeabie 
:2:ofaLandc,„onico,i.sn.ed.a.— ..oncMas.andpneu.on.a. 

„M,..<n/.„.n.^isahun,an-specif,corga„isn,,ha.no™^^^^ 

Lannasopha^nxand .us.co,oni.=*is3ite in o„ieno avo.d e«„c„on. Th, 
r:Jn^..o.>.ace.^u^scapa.,eofp—a — 

; lORO J Infect Dis. 146:564; Pichichero et aU 198- Lancei 
cells(Guennae/a/..l982.J.Intect.u ^ ^ . . Qn.2875.^879) 

I loo-x Proc Natl Acad. Sci. U.S.A. 90.28 /^--»'''^ 
M •960-96^- St. Geme et al, 1993. Proc. isau. 

:IiUon.H..-^.-^.Hasac<,ui«d..capaci..oe„.a.su^^^^^^^^^ 

ce.MForsg«n«./..19^.1nfec...n™-*^^^".679..St.GemeandFa.ko..^^ 

rIIJun.5S:40,^044;S.Oen,eandFalkow.,99,..nfec..In»u.59:,«5- 

"* «,,S6 337U Asaiesuh.*isbacieriumisaniraponanl 

1333.1nfect.lnmiun.59;3366-3o7l). Asa l, 1984 J Med 

..useofbo*.ocaUzedrespira.o,y»ac.a.dsys,e.icd,sease(Tu,k,.984,J.Me4 

:Lho.. U:...a,. Nonencapsuia^-non-typabies^nsaccoun...™ 
„noca.disease(Tu..i9S4.su,a,.ncon.^use.o.pebs.a.ns^.^^^-^^^^ 

. capsuie composed ofa po,yn.er of Hbose and '^^ -j; 

..,„..efo,ove,9S.ofcasesofK..>...».^^^^^^^^ 

Clinical in,por«nce o{ Haemophilus inftue^^e, p. .-9. S.H. 

disease, Bsevier/Nonh-Holland Publishing Co., New York). 

• «f HiQease due to H. influenzae involves 
The initial step in .he pathogenests of disease due J 
colonizationof the upper respiratory mucosa (Murphy « oL. 1987, L Infect^D 
^7.3-73.) Coloni.tlonwi.hapar.icula,s,rainn,aypersistforweelcs.ornon.h. 
andlstildividualsrenrainasyntptontaticthroughoutthisperiodCSpmola.... 

^86 intct Dis. 154.00-109,. However..ncer.ainc,rcun,stancescoloniza„<. 
!^^f""edbycontiguousspreadwi,hin.herespira.or>.tract.resuUi„.inloca, 

r ;lhe.nidLear..hes.nuses..heco.unctiva.or.he,ungs. ~ly. 
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on occasion bacteria will penetrate the nasopharyngeal epithelial barrier and enter 
the bloodstream. 

In vitro observations and animal studies suggest that bacterial surface appendages 
called pili (or fimbriae) play an important role in H. influenzae colonization. In 

5 1 982 two groups reported a correlation between piliation and increased attachment 

to human oropharyngeal epithelial cells and erythrocytes (Guerina et ai. supra; 
Pichichero et ai, supra). Other investigators have demonstrated that anti-pilus 
antibodies block in vitro attachment by piliated H. influenzae (Forney et al. . 1 992, 
J. Infect. Dis. 165:464-470;van Alphene/a/., 1988. Infect. Immun. 56:1 800- 1806> 

1 0 Recently Weber et al insertionally inactivated the pilus structural gene in an H. 

influenzae type b strain and thereby eliminated expression of pili; the resulting 
mutant exhibited a reduced capacity for colonization of year-old monkeys (Weber 
etal, 1991. Infect. Immun. 59:4724-4728). 

A number of reports suggest that nonpilus factors also facilitate Haemophilus 
1 5 colonization. Using the human nasopharyngeal organ culture model. Farley et al. 

(1986.J. Infect. Dis. 161:274-280) and Loebe/a/. (1988,lnfect. Immun. 49:484- 
489) noted that nonpiliated type b strains were capable of mucosal attachment. Read 
and coworkers made similar observations upon examining nontypable strains in 
a model that employs nasal turbinate tissue in organ culture (1991. J. Infect. Dis. 
20 163:549-558). In the monkey colonization study by Weber et al. (1991. supra). 

nonpiliated organisms retained a capacity for colonization, though at reduced 
densities: moreover, among monkeys originally infected with the piliated stram. 
virtually all organisms recovered from the nasopharynx were nonpiliated. All of 
these observationsare consistent with the fmding that nasopharyngeal isolates from 
25 children colonized with H. influenzae are frequently nonpiliated (Mason et al. , 1 985. 

Infect. Immun. 49:98-103; Brintone/ al.. 1 989. Pediatr. Infect. Dis. J. 8:554-561 X 



BNSDOCIO:<WO 363051 9A1 I > 



PCTAJS96/04031 
WO 96/30519 



Previous smdieshav«hovmtf«.HM«^«'« are capableofen,ering(invadmg) 

cultured human epithelial cells via a pili-independen, tnechanism (St. Geme and 
Falkow.l990.supra; St. GemeandFalko«,1991.supra). Although//. m/I".".-- 

is no, generally consider«I an intracellular parasite, a recent report suggests that 
5 .hesemv»ronndingsmayhavean,>.v/voco,Tela,e(Forsgrene,./..1994.supra). 
Forsg^nandcoworkcrsexanunedadenoidsfrotnlOchildrenwhohadtheiradenotds 
removed becauseoflongsta^iingsecretoryotitismedia or adenoidalhypertrophy. 
mall lOcasesthete were viable intracellular//, infl^ae. Electron microscopy 
demonstrated that these organisms were concentrated in Ute reticular crypt 
.10 epitheliumandinmacrophage-likecellsinthesubepitheliallayeroftissue. One 
possibilitv is that bacterial entry into host cells provides a mechanism for evas.on 
of the local immune response, thereby allowingpersistence in the respiratory tract 

Thus a vaccine for the therapeutic and prophylactic treatment of //uemopW/u. 
infection is desirable. Accordingly, it is an object of the present invention to pmvtde 
, 5 for recombinant Ha.mop«te Adherence (HA) proteins and variants thereof, and 

to produce useful quantities of these HA proteins using rccombmant DNA 

techniques. 

It is a furtherobjectof the inventionto providerecombinantnucleic acids encoding 
HA proteins, and expression vectors and host cells conuining the nucletc acd 
20 encoding the HA protein. 

An additional object of the invention is to p,»vide monoclonal antibodies for the 
diagnosis of Haemophilus infection. 

A further object of the invention is to provide methods for producing the HA 
proteins, and a vaccine comprising the HA proteins of the present invent.on. 
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Methods for the therapeutic and prophylactic treatment of Haemophilus infection 
are also provided. 

SUMMARY OF THE INVENTION 

In accordance with the foregoing objects, the present invention provides recombinant 
5 HA proteins, and isolated or recombinant nucleic acids which encode the HA 

proteins of the present invention. Also provided are expression vectors which 
comprise DNA encoding a HA protein operably linked to transcriptional and 
translational regulatory DNA, and host ceils which contain the expression vectors. 

The invention provides also provides methods for producing HA proteins which 
1 0 comprises culturing a host cell transformed with an expression vector and causing 

expression of the nucleic acid encoding the HA protein to produce a recombinant 
HA protein. 

The invention also includes vaccines for Haemophilus influenzae infection 
comprising an HA protein for prophylactic or therapeutic use in generating an 
1 5 immune response in a patient. Methods of treating or preventing Haemophilus 

influenzae infection comprise administering a vaccine. 

BRIEF DESCRIPTION OF THE DRAWINGS 

Figures lA, IB. and IC depict the nucleic acid sequence of HAl. 

Figure 2 depicts the amino acid sequence of HAl . 

20 Figures 3 A, 3B, 3C, 3D, 3E, 3F and 3G depict the nucleic acid sequence and amino 

acid sequence of HA2. 
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■l,^,y are indicated by shaded. stHped. and open ba.. cor.spon .ng o 
::™ildon.ins.in.e™Mon,a.s.a„dC— Mo.ains..«.^^^^^^ 
.„dci..es.ep.sen.censc„edWai.e.bo.ATP.«nd.nsn,o„(^^^^^^ 
, T^umbcrs above bars refer .0 amino acid residue posmons ,n *e full lenph 
pll Numbers in pa«n.eses beiow U,e HA. bars 
in-iiariry/percen, idemi. between .bese domains .d *e ^^^'^^^^ 
aomains. Tbe regions of HA2 defined by amino ac.d ^^^^^^^ 
846. and 1292 ,o 1475 show minimal similarity ,o ammoacds 51 to 220 otHAl. 

,0 Figure 5 depictsthe homology betwe^r the N-termina. amino acid sequences of 

„rianaHA2.Singlele..erabbrevia.ionsareusedfor.eam,noac.ds^A 

>„aica,esidentitybe.«een,heresidues.andt„odo.sindicateconservat,vechange. 
i.e. similarity between residues. 

Pigure6dcpictstherestric,ionmapsofphagell.nandp,asmidpT7.7subc,„ne. 

,5 P.,„e7 depictstherestrictionmapof pDC400andderivatives. pDC400con.ai. 

T..inse«..ms,rainC54c,onedintopUC,9. Vector sequences are repres^.^ 
•hatchedbo.es. Lettersabovethetophori^ontallmeindic—^^ 

horirontallinewitharrowrepresentsthelocat,onoftheM/locusw,th p 

,0 Z L direction of transcHption. The striated horizontal line «P-- ^ " 
rbinlgenicftagmentusedasaprobeforSouthemanalysis. m plasmtd pDC602. 
kb intragerai. uog r»r/;ni hut in the opposite 

which is not shown, contains the same tnser, as pDC601. 



orientation. 



25 



p-.ure 8 shows the identification of plasmid-encoded proteins using the 
Figure 8 shows tn radiolabeled with 

bacteriophage T7 expression system. Bacteria w 
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trans- [^^S]-labeL and whole cell lysates were resolved on a 7.5% 
SDS-polyacrylamidegel. Proteins were visualized by autoradiography. Lane K 
E. coli BL21(DE3)/pT7.7uninduced;lane 2, BL21(DE3)/pT7"7 induced: lane 3, 
BL21(DE3)/pDC602 uninduced; lane 4. BL21(DE3)/pDC602 induced; lane 5, 
5 BL21(DE3)/pDC601 uninduced: lane 6, BL21(DE3)/pDC601 induced. The 

plasmids pDC602 and pDC601 are derivatives of pT7-7 that contain the 8.3 ]^Xba\ 
fragment from pDC400 in opposite orientations. The asterisk indicates the 
overexpressed protein in BL21(DE3)/pDC601 . 

Figure 9 depicts the southern analysis of chromosomal DNA from H. influenzae 
10 strainsC54and 1 1, probing with //^ 2 versus DNA fragments were separated 

on a 0.7% agarose gel and transferred bidirectionally to nitrocellulose membranes 
prior to probing with either HA I or HA2, Lane 1 , C54 chromosomal DNA digested 
with BgHh lane 2, C54 chromosomal DNA digested with C/al; lane 3. C54 
chromosomal DNA digested with Psth lane 4, 1 1 chromosomal DNA digested with 
1 5 Bglll: lane 5,11 chromosomal DNA digested with Cla\\ lane 6, 1 1 chromosomal 

DNA digested with AT^^iI. A. Hybridization with the 3.3 kb Pstl-BgRl intragenic 
fragment of HA2 from strain C54. B. Hybridization with the 1.6 kb Sryl-Sspl 
intragenic fragment of HA I from strain 1 1 . 

Figure 10 depicts the comparison of cellular binding specificities of £. coli DH5a 
20 harboring HA2 versus HA J. Adherence was measured after incubating bacteria 

with eucaryotic cell monolayers for 30 minutes as described and was calculated 
by dividing the number of adherent colony forming units by the number of 
inoculated colony forming units (St. Geme et al., 1993). Values are the mean ± 
SEM of measurements made in triplicate from representative experiments. The 
25 plasmid pDC601 contains the HA2 gene from H influenzae strain C54. while 

pHM W8-5 contains the HA 1 gene from nontypable H, influenzae strain 1 1 . Both 
pDC601 and pHMW8-5 were prepared using pT7-7 as the cloning vector. 
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HMW. MDA...Tsh.anaSep.. TheN— >s=„.e„ceofHA2,sa.,gneav«* 
those of HAI (Barenkamp.SJ . and J.w.»i.<j 

(Barenkamp,S.J..andE.Leminger.iv^ v, „h «,nlecular 

„.,.surfac..exposeap.«ins«.a.=d.fUan,e„.o.— ^^^^ 

Aft. iim-l 3 1 3 ) AlDA-1 (Benz, 1., and M.A. SctimiQi. 
nprtiissis Infect. Immun. 60:1 302-1 J I J ). 

A.OA... *= a*.„ invowea in diffuse aaherence of *e dia„ho=a.e„ c 
L«..»«-in27S7(On6:H27Mssyna.esi.dviaaprecu.^^^^^^^^^ 

^.tra^c— fa.e„=invoWedin— u.,>aUon.,anavi.^^^^^^^ 

tL*. coU suain. .nfec. .mmun- «:n69.n80., and Sep A 
EscHer,cl.,a sansoneoi. and C. Parso.. 1995. SepA. the major 

exiraceuu. p ,7.p3-l35). A consensus sequence is shovvn 

in tissue invasion. Mol. Microbiol. 17.12^ 

on the lower line. 
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P e P depicu *e sou*«n analysis of chromosoma. DNA from 
Figure 12 depicts „f „ ,„^„,„<,etypeb.ChromosomalDNAwas 

epidemiologicallydistmctslrainsofK m/Iuen-oetyp . 

:ges.edwi.hB.m.separa.edona0.r/.asarose,e>.— — 
3„aprobedwia>.he3.3.b.«..B.min.age.cftagme.cf;./fr~CS^^ 
, ■ rS41ane^ 5mun)081;lane3.slrainl065;lane4.straml0.8.lane 

r::i:ro:::=a::i^ 

strain 1070-. lane 10. suain 1076; lane 1 1. strain 1084. 

. Pigure 13 depicts the — analysis of Cromosoma. DNA from no,,.yP=^b 
Iapsu,atedstrains„fH...e...C.omoson.a.DNAwasd,gested..^ 
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separated on a 0.7% agarose gel. transferred to nitrocellulose, and probed with the 
3.3 kb Pstl'BgUl intragenic fragment of hsf from strain C54. Lane 1, SM4 (type 
a); lane 2, SM72 (type c); lane 3, SM6 (type d); lane 4. Rd (type d); lane 5. SM7 
(typee); lane 6, 142 (typee): lane 7, 327 (typee): lane 8,351 (typee); lane9. 134 
5 (type f); lane 10, 219 (type 0: lane 1 1, 346 (type f); lane 12. 503 (type f). 

Figures 14A and 14B are the nucleic acid sequence of HA3. 

Figure 15 is the amino acid sequence of HA3. 

Figures 16A and 16B depict the homology between the amino acid sequences of 
HAl and HA3, Single letter abbreviations are used for the amino acids. A line 
1 0 indicates identity between the residues, and two dots indicate conservativechanges. 

i.e. similarity between residues. 

DETAILED DESCRIPTION OF THE INVENTION 

The present invention provides novel Haemophilus Adhesion (HA) proteins. In 
a preferred embodiment, the HA proteins are from Haemophilus sitmns^ and in the 
1 5 preferred embodiment, from Haemophilus influenza. In particular, H. influenzae 

encapsulated type b strains are used to clone the HA proteins of the invention. 
However, using the techniques outlined below, HA proteins from oxh^i Haemophilus 
influenzae strains, or from other bacterial species such as Neisseria spp. or 
Bordetalla spp. may also be obtained. 

20 Three HA proteins. HAl. HA2 and HA3, are depicted in Figures 2. 3 and 15. 

respectively. HA2 is associated with the fomiation of surface fibrils, which are 
involved in adhesion to various host cells. HAl has also been implicated in adhesion 
to a similar set of host cells. When the HAl or HA2 nucleic acid is expressed in 
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•„of£ co/i as described below, the £. co/i acquire the ability 
. non-adherentstra. of E.^ as ^^^^^ ^^^^ 

to adhere to human host cells. It 

referred to as hia '"y^-nz. adherence) and HA2 

{Haemophilus surface fibrils). 

1 .«,c A HAnucleicacidorHAprotein 
is initially identified by s ^ 3 i4orl5. Such homology can 

"r.rrr:2.»r.n;r»....^.--" 

BLASTX program (Altschul et al _,,esto be aligned. As 

aUgnmentmayincludetheintroductionofga^s^^^^^^^^^^ 

„otedbelow.inthecomparisonoproteuisc.d^^^^^^^^ 

HASwith HA2.thehomologyisdetemunedonthebasisot 



25 



sequence. 



e„dHA3 pro«,nsd^ict=din F,gores4 = and 1. 
50aminoacidsisv,nua,.yid.«ica.a3be,weenHA. and HA 
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and as between either HAl or HA3 and HA2 is 74%. As shown in Figure 1 K the 
first 24 amino acids of the N-terminusof HAl and HA2 has Hmited homology to 
several other proteins, but this homology is 50% or less. Thus, a HA protein may 
be defined as having homology to the N-terminal region of at least about 60%, 
5 preferably at least about 70%. and most preferably at least about 80%. with 

homology as high as 90 or 95% especially preferred. Similarly, the C-ierminal 
region of at least about 75, preferably 100 and most preferably 125 amino acid 
residues is also highly homologous and can be used to identify a HA protein. As 
shown in Figure 16. the homology between the C-terminal 120 or so amino acids 

10 of HAl and HA3 is about 98%. and as between either HAl or HA3 and HA2 is 

also about 98%. Thus homology at the C-terminus is a particulariy useful way of 
identifying a HA protein. Accordingly, a HA protein can be defined as having 
homology to the C-terminal region of at least about 60%, preferably at least about 
70%. and most preferably at least about 80%. with homology as high as 90 or 95% 

15 especially preferred. In a preferred embodiment, the HA protein has homology 

to both the N- and C-terminal regions. 

In addition, a HA protein may be identified as containing at least one stretch of 
amino acid homology found at least in the HAl and HA2 proteins as depicted in 
Figure 4. HA2 contains three separate stretchs of amino acids ( 1 74 to 608. 847 
20 to 1291. and 1476 to 1914. respectively) that shows significant homology to the 

region of HAl defined by amino acids 221 to 658. 

The HA proteins of the present invention have limited homology to the high 
molecular weight protein- 1 (HMWl)of//. /n/7wertrae. as well as the AIDA-I adhesin 
of £. colL For the HM W 1 protein, this homology is greatest between residues 60- 
25 540 of the HAl protein and residues 1 100 to about 1550 of HMWl. with 20% 

homology in this overlap region. For the AIDA-I protein, there is a roughly 50% 
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Ho.c,o,ybe,w.„,hefi..30™inoacMsofA,DA..a.dHA..a„d*eov=«« 
homology betwren the proteins is roughly 22%. 

,„ addiricn. *e HA,, HA2 and HA3 pro«ins of present invenUon have 
homo.ogyroeacho.her.assho«ninPigures4, 3and,5. A--^ 
„A2.*e homology is 8,% sintiiariry and 72% identity overan. HA^andHA 
^e51%idenaca,and65%similar. Thus. forthepurposesof the ,nvent,on.HAl. 

HA2 and HAS are all HA proteins. 

An "HAl ■• protein is deHned by substantial homology to the sequence shovvn in 
Pigure 2. This homology is preferably greater than about 60%, more preferably 
,ler than about 70% and most preferably greater than 80%. In preferred 
Lbodime„tsthehomologywil.beashighasabout,0.o95or98%.S.m.,ar,y. 

^.•HA2"pro.einmay be deflnedby the same substanUal homology tothe sequence 
showninFigure3.anda-HA3-proteinis denned wi.hreferenc=tongur=...as 

defined above. 
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inaddition-forsequenceswhichcontaineithermoreorfeweraminoacdsthanthe 
p„teins Shown in Figures 2, 3 and ,5, it ,s understood that the percentage 
homology vil, be determined based on the number of homologous amtno acds 
nrelatilto thetou. number of aminoacids. Thus, for example,homo,ogyo 

:quencesshorterthanthatshowninPigures2.3and,5,asdiscuss=dbe,o«-.w,l. 
he determined using the number of amino acids in the shorter sequence. 

HAproteinsofthepresentinventionmaybeshorterthantheaminoacidsequenc^ 

shol,nFigures2. 3 and ,5. Thus, ma preferred embodiment. tnCuded wthm 
aefinltioLfHAprote-nsareporttonsorfragmentsofthe sequence showntn 

Figures2,3andl5. Generally.theHAproteinfragmentsmay range tns^r^m 
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amino acids being preferred, and from about 1 00 to about 650 amino acids also 
preferred. Particularly preferred fragments are sequences unique to HA; these 
sequences have particular use in cloning HA proteins from other organisms, to 
generate antibodies specific to HA proteins, or for particular use as a vaccine. 
5 Unique sequences are easily identified by those skilled in the art after examination 

of the HA protein sequence and comparison to other proteins; for example, by 
examination of the sequence alignment shown in Figures 5 and 1 6. Preferred unique 
sequences include the N-terminal region of the HAl, HA2 and HAS sequences, 
comprising roughly 50 amino acids and the C-terminal 120 amino acids, depicted 

10 in Figures 2, 3 and 15. HA protein fragments which are included within the 

defmitionof a HA protein includeN- or C-terminal truncationsand deletions which 
still allow the protein to be biologically active; for example, which still allow 
adherence, as described below. In addition, when the HA protein is to be used to 
generate antibodies, for example as a vaccine, the HA protein must share at least 

15 one epitope or determinant with the sequences shown in Figures 2, 3 and 15. In 

a preferred embodiment, the epitope is unique to the HA protein; that is, antibodies 
generated to a unique epitope exhibit little or no cross-reactivity with other proteins. 
However, cross reactivity with other proteins does not preclude such epitopes or 
antibodies for inmiunogenic or diagnostic uses. By "epitope" or "determinant" 

20 herein is meant a portion of a protein which will generate and/or bind an antibody. 

Thus, in most instances, antibodies made to a smaller HA protein will be able to 
bind to the full length protein. 

In some embodiments, the fragment of the HA protein used to generate antibodies 
are small; thus, they may be used as haptens and coupled to protein carriers to 
25 generate antibodies, as is known in the art. 

- In addition, sequences longer than those shown in Figures 2. 3 and 15 are also 
included within the definition of HA proteins. 



BNSDOCID:<WO 9630519A1 I > 



PCTAJS96/04031 
WO 96/30519 

-14- 



10 



15 



20 



Pref=n>bly. anybodies are generated to a potion of the HA protein which is 
exposeda,theou,ermembrane.i.e.surfaceexposed.Theamino-.enninalport.o„s 

OfHAl . HA2 and HA3 are believed to be externally exposed protems. 

Ty,e HA proteins may also b« identified as associated vHth bacterial adhesion. Thus. 

deletionsof the HA proteins from the naturally occuring microorganism such as 
to».op«'»»P«ci=sr.sultsinadecr=aseorab«nceofbi„ding ability. Insome 

embodim=nts.the expression of tite HA proteins in a non-adheren. bactena such 
as £ coll results in the ability of the organism to bind to cells. 

,n the case of the nucleic acid, the overall homology of the nucleic acid sequence 
is commensurate with amino acid homology but takes into account the degenera^ 
m,hegeneticcodeandcodonbiasofdiffcremorganisms.According.y.thenucle. 

acid sequence homology may be either lower or higher than that of the pro.em 
sequence. Thus the homology of the nucleic acid sequence as compared to the 
nucleic acid sequencesof Figures 1 , 3 and 1 4 is preferably greaterthan about 40%. 
more preferably greater than about 60y. and most preferably greater than 80%. 
,„ some embodiments the homology will be as high as about 90 to 95 or 98%. 

Asoutlinedfor the pro,einsequences,ap«f«redembodim=ntuti.izes HA nucleic 
acids with substantial homology to the unique N-terminal and C-temtinal reg.ons 
of the HAl. HA2 and HA3 sequences. 

l„oneembodiment.thenucleicacidhomologyisdetenninedthroughhybrid,zaticn 
studies Thus, for example, nucleic acids which hybridize under high stringency 
to all or part of the nucleic acid sequences shown in Figures I, 3 and 14 are 

considered HA protein genes. High stringency conditions include, but are no. 

limited to. washes with O.IXSSC at 65'C for 2 hours. 
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The HA proteins and nucleic acids of the present invention are preferably 
recombinant. As used herein, "nucleic acid" may refer to either DN A or RN A. or 
molecules which contain both deoxy- and ribonucleotides. The nucleic acids include 
genomic DNA, cDN A and oligonucleotidesincluding sense and anti-sense nucleic 
5 acids. Specifically included within the definition of nucleic acid are anti-sense 

nucleic acids. An anti-sense nucleic acid will hybridize to the corresponding non- 
coding strand of the nucleic acid sequences shown in Figures 1 , 3 and 1 4, but may 
contain ribonucleotides as well as deoxyribonucleotides. Generally, anti-sense 
nucleic acids function to prevent expression of mRNA, such that a HA protein is 

1 0 not made, or made at reduced levels. The nucleic acid may be double stranded, 

single stranded, or contain portions of both double stranded or single stranded 
sequence. By the term "recombinant nucleic acid" herein is meant nucleic acid, 
originally formed in vitro by the manipulation of nucleic acid by endonucleases. 
in a form not normally found in nature. Thus an isolated HA protein gene, in a linear 

1 5 form, or an expression vector formed in vitro by ligating DNA molecules that are 

not normally joined, are both considered recombinant for the purposes of this 
invention: i.e. the HA nucleic acid is joined to other than the naturally occurring 
Haemophiluschromosomein which it is normally found. It is understood that once 
a recombinant nucleic acid is made and reintroduced into a host cell or organism. 

20 it will replicate non-recombinantly,i.e. using the iiLvivQ cellular machinery of the 

host cell rather than in vitro manipulations: however, such nucleic acids, once 
produced recombinantly.although subsequently replicatednon-recombinantly.are 
still considered recombinant for the purposes of the invention. 

Similarly, a "recombinantprotein" is a protein made using recombinant techniques. 
25 i.e. through the expression of a recombinant nucleic acid as depicted above. A 

recombinantprotein is distinguished from naturally occurring protein by at least 
one or more characteristics. For example, the protein may be isolated away from 
some or all of the proteins and compounds with which it is normally associated 
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i„ us «Ud .ype host, o, found in .he absence of U,e host ceUs 

p™.inI,hep,n*o.u— .VPuHfiedThe — 
L,i„„ofaHApro<e«,fromone organism ina<iifferentorgan.smorhos,ceH. 

, seen. ,^.h .he ^ ^ l^-;^:::ZZ.s. 

nromoter. such that the protein is made at increas 

1 lt.e..hep.^^^^ 

addition of an epitope tag or amino acid substitutions, tnsemons and dele^n. 
addition „^;,uvconsidered"recombinant",proteinsorport.ons 
Furthermore.althoughnotnormallyconsiaerea 

. J u v.iiv .wine the sequence information ot 
of proteins which are synthesized chemically, using the seq 

Figures 2. 3 and 1 5. are considered recombinant herein as well. 
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Mso included wi* *e defini.ion of HA pro.ein a« HA pro«ins fro. o*er 
organisms, which are cloned and expressed as ou.lined below. 
,„.Hecaseofa„.i-sensenucleicacids.anan.i.sensenuc.eicacidisdef.edasone 

Ich wii, hvbridize .o an or par. of .he corresponding non-cod,ng sequence of 
: !c,.nce;showninrigures,.3and,4. Genera.K.O,ehybridi..ioncond.^^ 

diesequen „<■,„,; sense hvbridiza<ion will be high s.nngency 

used for .he de.ermma.ion of ami-sense nyon 

conditions, such as O.IXSSC a. 65 -C. 

once the HA protein nucleic acid is identified, it can be Coned and if necessary^ 
Tc ...uentpartsr.omhined.oformtheentireHApro.einnucle.cac,d. one 

:tedfromLnatura.source.e.g..containedw,thi„ap.asmidorot^^^^^ 
le,cised.herefromasaUnea.nuc,eicacidsegment.ther^omb,nantHAprot=,n 
iraidcanbefurtherusedasaprobetoidentifyandtsolateotherHApro^. 
::licacids.ltcana.soheusedasa"p.cursor- nucleic acidtomalcemodtfled 

or variant HA protein nucleic acids and protetns. 
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Using the nucleic acids of the present invention which encode HA protein, a variety 
of expression vectors are made. The expression vectors may be either self- 
replicating extrachromosomal vectors or vectors which integrate into a host genome. 
Generally, these expression vectors include transcriptional and translational 
5 regulatory nucleic acid operably linked to the nucleic acid encoding the HA protein. 

"Operably linked" in this context means that the transcriptional and translational 
regulatory DNA is positioned relative to the coding sequence of the HA protein 
in such a manner that transcription is initiated. Generally, this will mean that the 
promoter and transcriptional initiation or start sequences are positioned 5* to the 

1 0 HA protein coding region. The transcriptional and translational regulatory nucleic 

acid will generally be appropriate to the host cell used to express the HA protein: 
for example, transcriptional and translational regulatory nucleic acid sequences from 
Bacillus will be used to express the HA protein in Bacillus . Numerous types of 
appropriate expression vectors, and suitable regulatory sequences are known in the 

1 5 art for a variety of host cells. 

In general, the transcriptional and translational regulatory sequences may include, 
but are not limited to, promoter sequences, leader or signal sequences, ribosomal 
binding sites, transcriptional start and stop sequences, translational start and stop 
sequences, and enhancer or activator sequences. In a preferred embodiment, the 
20 regulatory sequences include a promoter and transcriptional start and stop sequences. 

Promoter sequences encode either constitutive or inducible promoters. The 
promoters may be either naturally occuiring promoters or hybrid promoters. Hybrid 
promoters, which combine elements of more than one promoter, are also known 
in the art. and are useful in the present invention. 

25 In addition, the expression vector may comprise additional elements. For example. 

the expression vector may have two replication systems, thus allowing it to be 
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.aimained in .wo organisms, for exa„.p.. in man^ian or insec, ccUs for 
expressionand in a procaryorichos, for cloning and ampuncation. Furrtermore. 

for imeerating expression vectors. *e expr»sion vector contains a. least one 
secuenc'e homologous to the host cell genome, and preferably two homologous 
sequences which flank tite expression consm^t. The integrating vector may be 
directedto a specific locus in the host cell by selecUngthe appropriatehomologou. 
s«,uence for inclusion in the vector. Constructs for integrating vectors are well 
knoNvn in the art. 

,„addition.inapreferredembodiment.,heexpressionvectorcontainsaselec,abte 
marker gene to allowthe selection of transformed host cells. Selection genes are 
well known in the art and will vary with the host cell used. 

The HA proteins of the presem invention are produced by culturing a host cell 
transformed with an expression vector conuining nucleic acid encodmg a HA 
protein, under the appropriate conditionsto induce or cause expression of the HA 
protein. The conditions appropriate for HA protein expression will vary w,th the 
choice of theexpressionvectorand the host cell, and will be easily ascertained by 
one skilled in the art through routine experimenU.ion. For example, the use of 
constitutivepromotersin the expressionvectorwillr«,uireoptimizin,the growth 

and proliferationof the host cell, while the use of an iruiucible promoter requ.res 
dteappropriategrowthcondUionsforinduction. In addition, in some embodimenu^ 
the timing of the harvestis important. For example, the baculoviral systems used 
in insect cell expression are lytic viruses, and thus harvest time selection can be 
crucial for product yield. 

Appropriatehost cells includeyeast.bacteria.archebac.eria.fungi. and insect and 
animal cells, including mammalian cells. Of pariicular interest are 
,,,,1,^ cells, S2Sdui~™^ >'""^' " 
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subtilis . SF9 cells. C 1 29 cells. 293 cells, Neurospora, BHK, CHO, COS, and HeLa 
cells, immortalized mammalian myeloid and lymphoid cell lines. 

In a preferred embodiment, HA proteins are expressed in bacterial systems. 
Bacterial expression systems are well known in the art. 

5 A suitable bacterial promoter is any nucleic acid sequence capable of binding 

bacterial RNA polymerase and initiating the downstream (3') transcription of the 
coding sequence of HA protein into mRNA. A bacterial promoter has a transcription 
initiation region which is usually placed proximal to the 5* end of the coding 
sequence. This transcription initiation region typically includes an RNA polymerase 

10 binding site and a transcription initiation site. Sequences encoding metabolic 

pathway enzymes provide particularly useful promoter sequences. Examples include 
promoter sequences derived from sugar metabolizing enzymes, such as galactose, 
lactose and maltose, and sequences derived from biosynthetic enzymes such as 
tryptophan. Promoters from bacteriophage may also be used and are knovm in the 

15 art. In addition, synthetic promoters and hybrid promoters are also useful: for 

example, the tac promoter is a hybrid of the trp and lac promoter sequences. 
Furthemiore, a bacterial promoter can include naturally occurring promoters of non- 
bacterial origin that have the ability to bind bacterial RNA polymerase and initiate 
treinscription. 

20 

In addition to a functioning promoter sequence, an efficient ribosome binding site 
is desirable. In £. colL the ribosome binding site is called the Shine-Delgamo(SD) 
sequence and includes an initiation codon and a sequence 3-9 nucleotides in length 
located 3-11 nucleotides upstream of the initiation codon. 

25 The expression vector may also include a signal peptide sequence that provides 

for secretion of the HA protein in bacteria. The signal sequence typically encodes 
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asig™..p.p.idecompri«dofhyd™phobicaminoacid.whichai«c,*es.«=u«„ 
„f*epro:ei„fio,ne,ecU.asisweUkno»nin*ean.Thepr<,«misei*ersecr«ed 

in,o gro»* media (g^n-pcsMve bac«ria, or in.o periplastic space, .oca«d 
between *e inner and outer membrane of the cell (gram-negativc bactena). 

5 Thebacteria,exp.essionvec,ormayalsoinc,udeaselec.ablemarKergene.oa..o« 
f„r.hese.ectionofbac.=ria.strains,ha,havebeen.ransfonned.Sui.ab.ese,ecuo„ 
genesinCndegeneswhichrenderthebacteriaresistanttodrugssuchasamptCha 
ch,oramphenicoi.ery,hromycin.kanamycin,neomyci„and.etracyc,ine.Se.ectab,e 
markersalsoinc.udebiosynthe,icgenes.suchasthose,n,hehis,idine.t,yp.ophan 

10 and leucine biosynthetic pathways. 

These componentsare assembled intoexpressionvectors. Expression vectors for 
baceriaare well known in .he art. and include vectors for Boc/Hus s»Mto. E. col,. 
Sircpwcoccu! cremoris. and Streptococcus liyidcns. among others. 

The bacteria, expression vectors are transformed into bacterial host cells using 
,5 technicues well known in the ar. such as calcium chloride treatment. 

eleciroporation. and others. 

,„„„eembodimen..HAproteinsareproducedininsectcells. Expression vectors 
f„, the transformation of insect celis. and in particular, baculovims-based expresston 
vectors are well known in the art. Briefly.baculovirusisa very large DN A vtrus 
whichproducesitscoatp-oteinatvetyhighlevels. Due to dte size of the baculovtral 
genomcexogenousgenesmustbe placed in the viral genome by recombinatton. 
Accordingly.the componentsof the expression system include: a transfer vector, 
usuallv a bacterial plasmid. which conu.ns both a fragment of the baculovrtts 
genome, and a convenient restriction site for insertion of the HA protein: a wtld 
,5 ,ypebaculovintswithase,uencehomologoustothebaculov,n.s-specif,cfragmem 



20 
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in the transfer vector (this allows for the homologous recombination of the 
heterologous gene into the baculo virus genome); and appropriate insect host cells 
and growth media. 

Mammalian expression systems are also known in the art and are used in one 
embodiment. A mammalian promoter is any DNA sequence capable of binding 
mammalian RNA polymerase and initiating the downstream (3') transcription of 
a coding sequence for HA protein into mRNA. A promoter will have a transcription 
initiating region, which is usually place proximal to the 5* end of the coding 
sequence, and a TATA box. using a located 25-30 base pairs upstream of the 
transcription initiation site. The TATA box is thought to direct RNA polymerase 
II to begin RNA synthesis at the correct site. A mammalian promoter will also 
contain an upstream promoter element, typically located within 100 to 200 base 
pairs upstream of the TATA box. An upstream promoter element determines the 
rate at which transcription is initiated and can act in either orientation. Of particular 
use as mammalian promoters are the promoters from mammalian viral genes, since 
the viral genes are often highly expressed and have a broad host range. Examples 
include the SV40 early promoter, mouse mammary tumor virus LTR promoter, 
adenovirus major late promoter, and herpes simplex virus promoter. 

Typically, transcription termination and polyadeny lation sequences recognized by 
20 mammalian cells are regulatory regions located 3* to the translation stop codon and 

thus, together with the promoter elements, flank the coding sequence. The 3* 
terminus of the mature mRNA is formed by site-specific post- translationalc lea vage 
and polyadenylation. Examples of transcription terminator and polyadenlytion 
signals include those derived form SV40. 

25 The methods of introducing exogenous nucleic acid into mammalian hosts, as well 

as other hosts, is well known in the art, and will var>' with the host cell used. 
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Techniquesincludedextran-medxatedtransfc^^^^^ ^^^^^^^^^ ^^^^^ 
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of the polynucleotide(s)in liposomes, and direct mi 

nuclei. 

isomerase, glyceraldchya P ^j^, and the acid 

phosphatase gene. Yeastselect 

in the presence of copper ions. 

• av he exoressed intracellularly or secreted. The HA 
A recombinantHA protem may be expresse 

„,ade as a fusion protein to increase express-on. 

r u A proteins of the present invention are 
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protein. using cassette mutagenesis or other techniques well known in the art, to 
produce DNA encoding the variant, and thereafter expressing the DNA in 
recombinant cell culture as outlined above. However, variant HA protein fragments 
having up to about 100-150 residues may be prepared by in vUrP synthesis using 
5 established techniques. Amino acid sequence variants are characterized by the 

predetermined nature of the variation, a feature that sets them apart from naturally 
occurring allelic or interspecies variation of the HA protein amino acid sequence. 
The variants typically exhibit the same qualitative biological activity as the naturally 
occurring analogue, although variants can also be selected which have modified 
1 0 characteristics as will be more fully outlined below. 

While the site or region for introducing an amino acid sequence variation is 
predetermined, the mutation per se need not be predetermined. For example, in 
onier to optimize the performance of a mutation at a given site, random mutagenesis 
may be conducted at the target codon or region and the expressed HA protein 

1 5 variants screened for the optimal combination of desired activity. Techniques for 

making substitution mutations at predetermined sites in DNA having a known 
sequence are well known, for example. M 1 3 primer mutagenesis. Screening of the 
mutants is done using assays of HA protein activities; for example, mutated HA 
genes are placed in HA deletion strains and tested for HA activity, as disclosed 

20 herein. The creation of deletion strains, given a gene sequence, is known in the 

an. For example, nucleic acid encoding the variants may be expressed in an 
adhesion deficient strain, and the adhesion and infectivity of the variant 
Haemophilus influenzae evdAusi^d. For example, as outlined below, the variants 
may be expressed in the E. coli DH5a non-adherent strain, and the transformed 

25 £. coli strain evaluated for adherence using Chang conjunctival cells. 

* Amino acid substitutions are typically of single residues: insenions usually will 
be on the order of from about 1 to 20 amino acids, although considerably larger 
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^.derated D=l«ions range from abou.Uo 30 residues, althou^ 
i„ser,ionsmaybe.o.era,ed. D ^^^^ ^^^^ 

in some cases deletions may be much large 

domains of the HA protein is deleted. 

. . „;„^ or any combination thereofmay be used to arrive 
Substittttions, delettons. .nsemons cr any c ^ 

in certain circumstances. 

substitutions are generally made m accordan 

Chart 1 

10 rrirmr'nn'^M^^^^^tutions 

Ser 

Ala Lys 

Arg Gin. His 

Asn Glu 

1 5 Asp Ser 

Cys Asn 

Gin Asp 

Glu Pro 
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Gly Asn. Gin 

His Leu. Val 

He lie. Val 

Leu Arg. Gin, Glu 

Lys Leu. He 
Met Met. Leu, Tyr 

Phe Thr 
Ser Ser 
Thr Tyr 
Trp Trp. Phe 

Tyr He. Leu 

50 Val 
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Substantial changes in function or immunological identity are made by selecting 
substitutions that are less conservative than those shown in Chart I. For example, 
substitutions may be made which more significantly affect: the structure of the 
polypeptide backbone in the area of the alteration, for example the alpha-helical 

5 or beta-sheet structure; the charge or hydrophobicity of the molecule at the target 

site; or the bulk of the side chain. The substitutions which in general are expected 
to produce the greatest changes in the polypeptide's properties are those in which 
(a) a hydrophilic residue, e.g. seryl or threonyK is substituted for (or by) a 
hydrophobic residue, e.g. leucyl, isoleucyU phenylalanyl, valyl or alanyl; (b) a 

1 0 cysteine or proline is substituted for (or by) any other residue; (c) a residue having 

an electropositive side chain, e.g. lysyL arginyl, or histidyl, is substituted for (or 
by) an electronegative residue, e.g. glutamyl or aspartyl; or (d) a residue having 
a bulky side chain, e.g. phenylalanine, is substituted for (or by) one not having a 
side chain, e.g. glycine. 

1 5 The variants typically exhibit the same qualitative biological activity and will elicit 

the same immune response as the naturally-occurringanalogue, although variants 
also are selected to modify the characteristics of the polypeptide as needed, 
Altematively,the variant may be designed such that the biological activity of the 
HA protein is altered. For example, the Walker box ATP-binding motif may be 

20 altered or eliminated. 

In a preferred embodiment, the HA protein is purified or isolated after expression 
HA proteins may be isolated or purified in a variety of ways known to those skilled 
in the ail depending on what other components are present in the sample. Standard 
purification methods include electrophoretic. molecular, immunological and 
25 chromatographic techniques, including ion exchange, hydrophobic, affinity, and 

reverse-phase HPLC chromatography, and chromatofocusing. For example, the 
HA protein may be purified using a standard anti-HA antibody column. 
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Ulmf.tationanddiafil.mSon»:hniqu=s.mco.gmctionwiU.pro«in<»ncentiaaoa 
are also usefU. For general guida.«e in suitable puriflcarion techniques, see Scopes. 
R Protein Purification, Springer-Verhg, NY (1982). The degree of purincatton 
necessary ^«ll vary depending on the use of the HA protein. In some instances no 
5 purification will be necessary. 

Once expressed and purified if necessary, the HA proteins are useftil in a number 

of applications. 

For example, the HA proteins can be coupled, using standard technology, to affinity 
chromatographycolumns. These columns may then be used to purif>- ant.bodtes 
10 from samples obtained from animals or patients exposed to the HaemopHilus 

my7«e«z^.organism. The purified antibodiesmay then be used as outlinedbelow. 

Additionallv.the HA proteins are useful to make antibodiesto HA proteins. These 
antibodies find use in a number of applications. The antibodies are used to diagnose 
the presenceof an Haemophilus influenzae infection in a sample or patient. In a 
1 5 preferredembodimem.the antibodiesare used to detectthe presence of nontypable 

Haemophilus influenza (NTHl). although typable H influenzae infections are also 
detected using the antibodies. 

This diagnosis will be done using techniques well known in the an: for example, 
samples such as blood or tissue samples may be obtained from a patiem and tested 
,0 for reactivitv with the antibodies, for example using standard techniques such as 

ELISA m a preferred embodiment, monoclonal antibodies are generated to the 
HA protein, using techniques well known in the art. As outlined above, the 
antibodies may be generated to the full length HA protein, or a portion of the HA 
protein. 
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Antibodies generated to HA proteins may also be used in passive immunization 
treatments, as is known in the art. 

Antibodies generated to unique sequences of HA proteins may also be used to screen 
expression libraries from other organisms to find, and subsequently clone, HA 
5 nucleic acids from other organisms. 

In one embodiment, the antibodies may be directly or indirectly labelled. By 
"labelled" herein is meant a compound that has at least one element, isotope or 
chemical compound attached to enable the detection of the compound. In general 
labels fall into three classes: a) isotopic labels, which may be radioactive or heavy 
10 isotopes; b) immune labels, which may be antibodies or antigens; and c) colored 

or fluorescent dyes. The labels may be incorporated into the compound at any 
position. Thus, for example, the HA protein antibody may be labelled for detection, 
or a secondary antibody to the HA protein antibody may be created and labelled. 

In one embodiment, the antibodies generated to the HA proteins of the present 
1 5 inventionare used to purify or separate HA proteins or the Haemophilus influenzae 

organism from a sample. Thus for example, antibodies generated to HA proteins 
which will bind to the Haemophilus influenzae organism may be coupled, using 
standard technology, to affinity chromatography columns. These columns can be 
used to pull out the Haemophilus organism from environmental or tissue samples. 

20 In a preferred embodiment- the HA proteins of the present invention are used as 

vaccines for the prophylactic or therapeutic treatment of a Haemophilus influenzae 
infection in a patient. By "vaccine" or "immunogenic compositions" herein is meant 
an antigen or compound which elicits an immune response in an animal or patient 
The vaccine may be administered prophylactically, for example to a patient never 

25 previously exposed to the antigen, such thai subsequent infection by the 
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„H«us .Mn^ org^is. is prevent.. A— y. ^ ac n - 
,e ad.i„istc«a«Ucany.c a pa.ie„. previously exposed or .„fec,«i by * 

..a„i..u„erespo„seis.e„e..ed.hicha..ows*epaUen.sin«nu„=sy.^^ 
,o.o«effe«ive,ycon,ba.d«infecUon.Th«.fore,-ple,*eremaybeadecrease 

or lessening of the symptoms associated with infecuon. 
A>uien,"forthepu,posesof,hepresentinventioninc,udesbothhu.nansand^ 
animalsand organic. Thus the methods are applicable to both human therapy 
and veterinary applications. 

The administration of the HA protein as a vaccine is done in a variety of ways. 
Oenerallv. the HA pn..eins can be fonnulated according to Icnown method, to 
p„parepharmaceutical.yusefi.lcompositions. Whereby therapeutically effecttve 

amounts of the HA protein are combined in admixture with a pharmaceut.cally 
acceptablecarriervehidcSuitablevehiclesandUteirformulationare well known 
5 intheart. Such compositions will contain an effective amount of the HA protetn 

,„.c.her with a suitable amount of vehicle in order to prepare pharmaceufcally 
a;ceptab.ecompositionsforeffectiveadmi„istration.othehost.Thecompos,t,a, 

.av include salts, buffers, carrier proteins such as serum albumin, targeung 
molecules to localize the HA protein at the appropriate site or ttssue w,th,„ 
,0 organism.andothermolecules. The composition may include adjuvants as well 

,n one embodime„..the vaccine is administered as a single dose; tha, is. one do« 
is adequate to induce a sufficient immune response to prophylacttcally or 
,herapeutical.ytreata«*«»./«y;"-..infection.lnaltema.eembod,ment. 

^ Iccne is administ«ed as seve«. doses over a period of time, as a pnmary 
25 vaccination and "booster" vaccinations. 



wo 96/30519 



-29- 



PCT/US96/04031 



By "therapeutically effective amounts" herein is meant an amount of the HA protein 
which is sufficient to induce an immune response. This amount may be different 
depending on whether prophylactic or therapeutic treatment is desired. Generally, 
this ranges from about 0.001 mg to about 1 gm, with a preferred range of about 0.05 
5 to about .5 gm. These amounts may be adjusted if adjuvants are used. 

The following examples serve to more fully describe the manner of using the above- 
described invention, as well as to set forth the best modes contemplated for canying 
out various aspects of the invention. It is understood that these examples in no way 
serve to limit the true scope of this invention, but rather are presented for illustrative 
1 0 purposes. All references cited herein are specifically incorporated by reference. 

EXAMPLE 1 
Cloning ofHAl 

Many protocolsare substantially the same as those outlined in St. Geme et al.. Mol. 
Microbio. 15(l):77-85 (1995). 

15 Bacterial s trains, plasmids. and phages. 

Nontypable H. influenzae strain 1 1 was the clinical isolate chosen as a prototypic 
HMWl/HMW2-non-expressingstrain, although a variety of encapsulated typable 
strains can be used to clone the protein using the sequences of the figures. The 
organism was isolated in pure culture from the middle ear fluid of a child with acute 
20 otitis media. The strain was identified as H. influenzae by standard methods and 

was classified as nontypable by its failure to agglutinate with a panel of typing 
antiserafor H. influenzaevypesato f (Burroughs Wellcome Co.. ResearchTriangle 
Park. N.C.) and failure to show lines of precipitation with these antisera in 
counterimmunoelectrophoresis assays. Strain 1 1 adheres efficiently to Chang 
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oo„iu„c,iva.c=»« .n v/,r„. a. levels comparable .0 *ose previously den,o„sua.ed 
for n™. strains expressing HMWl/HMW2,lilce proteins (dau no, sho,™). 
convalesces serun, fron, U,e child infeced wiU, *is strain demo^trared an 
a„Ubodvresponsedirec.edpredon.ina„Uyagainstsurface.xposedlughmo,ecu,. 

weigh, prouins wi* molecular weights greater than 100 kDa. 

M13mplSandMnmpl9wereobtainedfromNewE„glandBi„Ubs.,nc.»^ 
Mass ,pT7.7wasthekindgiftofStanleyTabor. Thisvectorcon.a.nstheT7RNA 
polvmerase promoter <,10. a Hhosome-bindingsite. and the — ^ -n s.,e 
for'the T7 gene 1 0 protein upstream from a multiple donmg s,te. 

nul l fii lar -hir '-i T •"•< ■i"lw-'™'»«' 

The r^ombinant phage containing the HA , gene was isolated and characteri«d 
usingmethodssimilarto those describedpreviously. In brietchromosoma^DNA 
Js.ramUwasprepar=dandS<,„..partialres,rictiondigestsonheDNAwe. 

pupated and fractionated on 0.7% agarose gels. Fractions contatnmg DNA 
, ra.ments in the 9- to 20- Kbp range were pooled, and a hbrary ^^'"'^^ 

,.;a.ion ,n.o .EMBL3 arms. Ligation m,x.u.s were packaged ~* 

Cgapack <S„e> and p.a,e-amplir,ed 

Lambdaplaqueimmunologicalscreemngwasperfomtedasdescnb 

„a.MoUarC.oning:ALabora,o,>.Ma„ual.2dEd.a9S„.Co,dSpnngHarbor 

,„ P„,; For plasmid subcloning studies. DNA from recombinant phage w^ 
>0 rress. r p , -j t-i -T Sttmdardmeihodswere used for 

subclonedin,otheT7expressionplasm,dpT7.7. Standardmet 

„,anipulation of cloned DNA as described by Maniatis e, al (supra). 

Plasmid PHMW8-3 was generared by isola,ing an 1 1 kbp Xbal ^gmen, fi.m 
p^fiedDNAftomrecombinantphageclone n.l7andhga.ing.n.oXbalcutpT - 

;LidpHMWS.4 was generated by isolattng a ,0 kbp B..H.-C.1 cut pT7-7. 
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Plasmid pHMW8-5 was generated by digesting plasmid pHMW8-3 DN A with ClaL 
isolating the larger fragment and religating. Plasmid pHM W8-6 was generated by 
digesting pHMW8-4 with SpeU which cuts at a unique site within the HA J gene, 
blunt-ending the resulting fragment, inserting a kanamycin resistance cassene into 
5 ihc Spe\ site. PlasmidpHMW8-7wasgeneratedby digesting pHMW8-3 with Arwl 

and HindllL isolating the fragment containing pT7-7, blunt-ending and religating. 
The plasmid restriction maps are shown in Figure 6. 

DNA sequence analysis. 

DNA sequence analysis was performed by the dideoxy method with the U.S. 
10 Biochemicals Sequenase kit as suggested by the manufacturer. [^*S]dATP was 

purchased from New England Nuclear (Bostoa Mass). Data were analyzed with 
Compugene software and the Genetics Computer Group program from the 
University of Wisconsin on a Digital VAX 8530 computer. Several 21-mer 
oligonucleotide primers were generated as necessary to complete the sequence. 

15 A^ber^n^f as$^ys> 

Adherence assays were done with Chang epithelial cells [Wong-Kilboume 
derivative, clone 1 -5c-4 (human conjunctiva), ATCC CCL20.2)]. which were seeded 
into wells of 24-well tissue culture plates, as described (St. Geme III et al.. Infect. 
Immun. 58:4036 ( 1 990)). Bacteria were inoculated into broth and allowed to grow 

20 to a density of approximately 2 x 1 0^ colony-forming units per ml. ApproximateK 

2x10^ colony-forming units were inoculated onto epithelial cells monolayers, and 
plates were gently centrifuged at 165 x g for 5 min to facilitate contact between 
bacteria £ind the epithelial surface. After incubation for 30 min at 37'' C in 5% CO., 
monolayers were rinsed five times with phosphate buffered saline (PBS) to remove 

25 nonadherent organisms and were treated with tr\'psin-EDTA(0.05% tr>'psin/0.5% 
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EDTA) in PBS to release them from the plastic support. Well contents were 
agitated, and dilution were plated on solidmediumto yield the number of adherent 
bacteria per monolayer. Percent adherence was calculated by dividing the number 
of adherent colony-forming units per monolayer by the number of inoculated colony- 
5 forming imits. 

»^^,^ ^^ » n .h,r«ct^riz«tionof recoiiibinmitfthar«? gwrf^^inrthg ^tnin 11 

lllig fT ipniecular wypght adhesioH Prottill. 

The nontypable Haemophilus influenzae strain 1 1 chromosomal DN A library was 
screened immunologically with convalescent serum from the child infected with 
10 strain 1 1 . Immunoreactive clones were screened by Western blot for expression 

ofhigh molecular weightproteins with apparemmolecularweights> 100 dDaand 

two different classes of recombinant clones were recovered. A single clone 
designated 1 1 -1 7 was recovered which expressed the HAl protein. The recombinant 
protein expressed by this clone had an apparem molecular weight of greater than 
15 200 kDa. 

TT^n^formation into E. coli 

Plasmids were introduced into DH5a strain of E. coli (Maniaiis, supra), which is 
a non-adherent strain, using electroporation (Dower et al., Nucl. Acids Res. 16:6127 
(1988). The results are shown in Table 1 . 
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Strain 


% Adherence' 


DH5a(pHMW 8-4) 


43.3 ± 5.0% 


DH5a(pHMW 8-5) 


41.3 ±3.3% 


DH5a(pHMW 8-6) 


0.6 ± 0.3% 


DH5a(pHMW 8-7) 




DH5a(pT7-7) 


0.4 ±0.1% 



'Adherence was measured in a 30 minute assay and was calculated by dividing the 
number of adherent bacteria by the number of inoculated bacteria. Values are the 
1 0 mean ± SEM of measurements made in triplicate from a represeniativeexperiment 

In addition, a monoclonal antibody made by standard procedures, directed against 
the strain 1 1 protein recognized proteins in 57 of 60 epidemiologically-unrelaled 
NTHI. However. Southern analysis using the gene indicated that roughly only 25% 
of the tested strains actually hybridized to the gene (data not shown). 

15 EXAMPLE 2 

Cloning ofHA2 

In a recent study we examined a series of H. influenza type b isolates by 
transmission electron microscopy and visualized short, thin surface fibrils distinct 
from pili (St. Geme, J.W.IIL and D. Cutter. 1995. Evidence that surface fibrils 
20 expressed by Haemophilus influenzae type b promote attachment to human epithelial 

cells. Mol. Microbiol. 15:77-85.)- In that study, the large genetic locus involved 
in the expression of these appendages was isolated. 

Bacterial strains and plasmids 
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10 



15 



20 



25 



„. influenzae strain C54 is a type b strain that has been described previously 
(Pichichero. M.E.. P. Anderson. M. Loeb. and D.H. Smith. 1982. Do piH play a 
role in pathogenicity Haemophilus influenzae type b^ Lancet. ii:960-962.). Stnun 
C54-Tn400 23 is a mutant that contains a mini-TnyO kan element in the hsfXocns 
and demonstrates mir^imal .Uro adherence (St. Geme. J.W.lIl. and D. Cutter. 
1995 Evidence that surface fibrils expressed by Haemophilus influenzae type b 
promoteattachmenttohumanepithelialceUs.Mol.Microbiol.l5:77-85.). Strains 
,053 1058 1060. 1063. 1065. 1069, 1070, 1076, 108Landl084areH.m/Z«e«z«e 
tvpe b isolates generously provided by J. Musser (Baylor University. Houston. 
Texas) (Musser et al., 1990. Global genetic structure and molecular ep.dem.ology 
of encapsulated Haemophilus influenzae. Rev. Infect. Dis. 12:75-111.). H 
influenzae strains SM4 (type a). SM6 (type d). SM7 (type e). and SM72 (type c) 
are type strains obtained from R. Facklam at the Centers for Disease Control 
(Atlanta.Georgia). Strains 142. 327. and 351 are//. ,„/Z«en.-«. type e isolates. and 
strains 134 219, 256. and 501 are H influenzae type f isolates obtained from H. 
Kavhtv (FimiishNational Public Health Institute. Helsinki). StrainRd (type d) and 
the 15 nontvpable isolates examined by Southern analysts have been described 
previously (Alexander et al.. J. Exp. Med. 83:345-359 (1951 ): Barencamp et al.. 
infect Immun. 60:1302-1313(1992)). E. coli DH5a is a nonadherent laboratory 
strainthatwasoriginallyobtainedfromGibcoBRL. £. co/, strain BL21(DE3) was 
a gift from F.W. Studier and contains a single copy of the T7 RNA polymerase gene 
under the control of the regulatory- system (Studier. F.W.. and B.A. Moffatt. 
1986 Use of bacteriophage T7 RNA polymerase to direct high-level expression 
of cloned genes. J. Mol. Biol. 189: 1 13-130.). Plasmid pT7-7 was provided by S. 
TaborandcontainstheT7RNApolymerasepromoterfl0.aribosome.bindmgsue. 

and the translationalstartsite for the T7genel0prote.n upstream fromamultiple 

cloning site (Tabor. S.. and Cr. Richardson. 1985. A bacteriophage T7 RNA 
polymerase/promotersystemforcontrolledexclusiveexpressionofspecificgenes. 

Proc Natl Acad. Sci. USA. 82:1074-1078.). pUC19 isahigh-copy-numberplasmid 
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that has been previously described ( Yanish-Perronet ah. Gene 33 : 1 03- 1 1 9 ( 1 985)). 
pDC400 is a pUCl 9 derivative that harbors the K influenzae strain C54 surface 
fibril locus and is sufficient to promote in vitro adherence by laboratory strains of 
£. coli (St. Geme, J.W.IIL and D. Cutter. 1995, Evidence that surface fibrils 
5 expressed by Haemophilus influenzae type b promote attachment to human epithelial 

cells. Mol. Microbiol. 15:77-85.). pHMW8-5 is a pT7-7 derivative that contains 
the H. influenzae strain 1 1 hia locus and also promotes adherence by nonadherent 
laborator\' strains of £. co//(Barenkamp,S.J.,and J.W. St. GemeJII. Identification 
of a second family of high molecular weight adhesion proteins expressed by 

10 nontypable Haemophilus influenzae. Mol, Microbiol., in press.). pHMW8-6 

contains the //. influenzae hia locus interrupted by a kanamycin cassette 
(Barenkamp.S.J., and J.W. St. Geme. III. Identification of a second family of high 
molecular weight adhesion proteins expressed by nontypable Haemophilus 
influenzae, Mol. Microbiol., in press.). pUC4K served as the source of the 

1 5 kanamycin-resistancegene that was used as a probe in Southern analysis (Vieira, 

J., and J. Messing. 1982. The pUC plasmids, an M13mp7-derived system for 
insenion mutagenesis and sequencing with synthetic universal primers. Gene. 
19:259-268.). 

Culture conditions 

20 H, influenzae strains were grown on chocolate agar supplemented with 1% Isovitale 

X. on brain heart infusion agar supplemented with hemin and NAD (BHI-DB agar), 
or in brain heart infusion broth supplemented with hemin and NAD (BHIs) 
(Anderson. P., R.B. JohnstonJr.. and D.H. Smith. 1972. Human serum activity 
di^d\v\s\ Haemophilus influenzae xypt b. J. Clin. Invest. 51:31-38.). These strains 

25 were stored at -80<: in brain heart infusion broth with 25% glycerol. £. coli strains 

were grouTi on Luria Bertani (LB) agar or in LB broth and were stored at -80**C 
in LB broth with 50% glycerol. For H. influenzae, kanamycin was used in a 
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c„„ccmr.,io„ of 25 mg/ml. Amibictic cc„=cn«a.ions for E. coU induded *e 
following: an,pkillin or carbenicillin 100 mg/ml and kanamycin 50 mg/ml. 

Induclion of plasmid-encoded proteins 

Toidcnufyplasmid.«Kodedprotdns.tf,= baaeriophageT7 expression vec.orpT7-7 

was employed and the relevant pT7.7 derivatives were transformed mto £ col. 
BL-1(DE3) ActivationoftheT7promo.erwasachievedbyinducingexpress.oo 
of T7 RNA polvmerase with isopropyl-b-D-thtogalaCopyranoside (final 
concentratio„.lmM). After inductionfor 30 minutesat37^.ri6mpicinwasadded 

,„ a final concentration of 200 mg/ml. Thirty minutes later. 1 ml of culture was 
pulsed with50mCiof.rans-P'Sl.label(lCN.lrvine.Calif)for5minutes.Bac.ena 

were harvested, and whole cell lysates were resuspended in Laemml, buffer for 
analysis by sodium dodecyl sulfate-polyacrylamide gel electrophoresis on 7.5% 
acrylamide gels (Laemmli.U.K. 1970. Cleavage of structural proteins dunng the 

assembly of Ute head of bacteriophage T4. Nature (London). 227:680-685.). 
Autoradiography was performed with Kodak XAR-5 f,lm. 

Recombinant DNA methods 

DNA Ugations.restrictionendonuclease digestions, and ge, electrophoresis were 

perfonned according to standard techniques (SambrooU. i.. E.F. Fritsch. a^d T. 
Maniatis.l989.Molecularcloning:alabo.atorymanual.2nded.Co,dSpringHarbor 

Laboratorv. Cold Spring Harbor. N.Y.). Plasmids were introduced tnto £. co/, 
strainsbv either ch=mica,t«nsfonna.ionorelectropora,ion.asdescribed(Dower. 

W J J F Miller, and C.W. Ragsdale. 1988. High effciency transformatton of £ 
„„-bvhighvoltageelee,roporation.NucleicAcidsRes..6:6127-6l45..Sambroo. 

, E F Fritsch and T. Maniatis. 1989. Molecular cloning: a laboratorv- manual. 
,ndedColdSpnngHarborUborator>.ColdSpnngHarbor.N.Y.).T,ansfom.tion 

- ; H <„»wasperformedusingthe MIV methodof Herriottet al. (Herr,o«. 
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R.M.. E.M. Meyer, and M. Vogt. 1970. Defined nongrowth media for stage II 
competence in Haemophilus influenzae. J. Bacteriol. 101:517-524.). 

Adherence assays 

Adherence assays were performed with tissue culture cells which were seeded into 
5 wells of 24-weU tissue culture plates as previously described (St. Geme et al.. Infect. 

Immun. 58:4036-4044( 1 991 )). Adherence was measured after incubating bacteria 
with epithelial monolayers for 30 minutes as described (St. Geme, J.W.IIL S. 
Falkow,and S.J. Barenkamp. 1993. High-molecular- weightproteins of nontypable 
Haemophilus influenzae mediate attachment to himian epithelial cells. Proc. Natl. 

10 Acad. Sci. U.S.A. 90:2875-2879.). Tissue culture cells included Chang epithelial 

cells (Wong-Kilboumederivative,clone l-5c-4 (human conjunctiva))(ATCCCCL 
20.2), KB cells (human oral epidermoid carcinoma) (ATCC CCL 1 7), HEp-2 cells 
(human laryngeal epidermoid carcinoma) (ATCC CCL 23), A549 cells (human 
lung carcinoma) (ATCC CCL 1 85), Intestine 407 cells (human embryonic intestine) 

1 5 (ATCC CCL 6), HeLa cells (human cervical epitheloid carcinoma) (ATCC CCL 

2 ). ME- 1 80 cells (human cer\'ical epidermoid carcinoma) (ATCC HTB 33). HEC-IB 
cells (human endometrium) (ATCC HTB 1 13). and CHO-Kl cells (Chinese hamster 
ovar>' ) (.ATCC CCL 61 ). Chang. KB, Intestine 407, HeLa. and HEC-IB cells were 
maintained in modified Eagle medium with Earle's salts and non-essential amino 

20 acids. HEp-2 cells were maintained in Dulbecco*s modified Eagle medium. A549 

cells and CHO-Kl cells in FI2 medium (Ham), and ME- 180 cells in McCoy 5 A 
medium. All media were supplemented with 10% heat-inactivated fetal bovine 
serum. 

Southern analysis 

25 Southern blotting was performed using high stringency conditions as previously 

described (St. Geme, J.W.IILand S. Falkow. 1991. Loss of capsule expression by 
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Haemophilus influenzae type b results in enhanced adherence to and invasion of 
human cells. Infect. Immun. 59:1325-1333.). 



Microscopy 

Sa,np,=spfepi*=Ua.c.Uswi*associa.=dbac,eriaweres.in=dw,U,0,.msas.,n 
a„aex™inedbyHgh,m™yasdcscribed,S.Ge.c.J.W.Il..andSTa..ow^ 
S. ,990. H«mo;.«/«..>.><^-»'adher=s.o »,d enKrs cultured human =p.a,ehal 
cells. Infect. Immun. 58:4036-4044.). 

For „ega.ive-su.ning=tec,ronmicro«opy.b,«criaw«e rained vvi,h0.5-/. aqueous 
uranvUceu«e(S,.Q=me.J.W.nl.a„dS.Fa.kow..991.Losscfcapsu.eexpress,c. 

,v >.fluen.ae^y^ b results in enhanced adherence .o and invas.on 

of human cells. Infec Immun. 59:1325-1333.) and examined using a Ze,ss lOA 

microscope. 

The previous smdy indicated that laboratory E. coU strains harboring the plasmid 
PDC400 were capable of efficient attachment to cultured human epithelial cells 
,S, Oeme J.W.Ul. and D. Cutter. 1995. Evidence that surface fibrils expressed 
bv Hacn.opHHus ,>,/7»e«=<...ype b promote anachmem to human cpithehal cells. 
Mol. Microbiol. .5:77-85... Subcloning stud.es and transposon mutagenests 
mdicatedtha,therel.vantcodingregionofpDC400 was presem within an 8.. kb 

AMfragmen.(St.Oeme.i.W.ni.andD.Cu„er.l995.Evidencethatsurfacefibn. 
e.p.ssedbv«MV«/n/,.««»»m«bpromoteanachmenttohumanep,*^^^^ 
cells Mol. Microbiol. 15.77-85.) (Figure 7). To confirm this conclusion, m the 
preset studv this X,a^ fragment was subcloned into pT7-7. generating p.asmtds 
designa.edpDC60.andpDC60:.whichcon.ainedtheins=ninopposi.eorien,a„ons 

.Figure 7). As predicted, expression of these plasmids in £. co, DH5a was 
associated with a capacity for high level « v/,ro anachment (Table 1 ). 
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Table 1. Adherence to Chang conjunctival cells. 



Sirain 



ADHERENCE f% inoculumV 



DH5a/pT7-7 
DH5a/pDC400 
DH5a/pDC601 
DH5a/pDC602 



0.4 ±0.1 



25.3 ± 1.2 



54.3 ±7.5 



55.5 ±4.3 



C54bp- 

C54-HAl::kan'' 



98.7 ±9.5 



1.5 ±0.2 



C54-Tn400.23' 



3.3 ±0.4 



10 



15 



20 



'Adherence was measured in a 30 minute assay and was calculated by dividing the 
number of adherent bacteria by the number of inoculated bacteria. Values are the 
mean ± SEM of measurements made in triplicate from representativeexperiments. 
''Strain C54-HA1 ::kan was constructed by transforming C54bp" with linearized 
pHMW8-6. which contains the HA 1 gene with an intragenic kanamycin cassette. 
'Strain C54-Tn400.23 contains a mini-Tn70*on element in the hsf\oc\xs (St. Geme 
eta!.. Mol. Microbiol. 15:77-85 (1995)). 

To determine the direction of transcription and identify plasmid-encoded proteins. 
pDC601 and pDC602 were subsequently introduced into £. coli BL21(DE3). 
producing BL21(DE3)/pDC601 and BL21(DE3)/pDC602, respectively. As a 
negative control, pT7-7 was also transformed into BL2 1 (DE3). The T7 promoter 
in these three strains was induced with IPTG. and induced proteins were detected 
using trans- p'S]-label. As shown in Figure 8, induction of BL21(DE3)/pDC601 
resulted in expression of a large protein over 200 kDa in size along with several 
slightly smaller proteins, which presumably represent degradation products. In 
contrast, when BL21(DE3)/pDC602 and BL21(DE3)/pT7-7 were induced, there 
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was no expression of *ese pro.etas. This expen^em indicated ^ *e geneUc 
„,.,eria, confined in ,he 8.3 Ub ASd fragment is .ranscribed fron, left ,0 ngh. as 
shown in Figure 7 and suggested that a single long open reading frame may be 



present. 



3 Nucleotide sequencing 

Nucleotide sequence was detemtined using a Se,uenase Wt and doubie-su^ded 
p,asmid,emplate.DNAfragm««swere»bclon«iin.opUC19andse,uencedalo^ 

ho.hstrandsbyprimerwalking.DNAse<,uenceana,ysiswasperforn,edus.ng.he 
Genetics Computer Group (GCG) software package from the Universtty of 
0 Wisconsin (Devereux. P. Haeberli. and O. Smithies. 19^. A compreh.ns.ve 

setofse<,uenceanalysisprogran,sfortheVAX.NucleicAc,dsRes.l2..87.39^> 

sequence similarity searches were carried out using the BLAST P"^-> 
Nation^ Center for Biotechnology Infomtation (Altschul. S.F.. W. Gish. W. M,te 
E.W. Myers, and D.J. Lipman. 1990. Basis local alignment search tool. J. Mol. Btol. 
15 215:403^410.). 

Sequencing of the 8.3 Vtb XBal fragment revealed a 7059 bp gene, wh.ch is 
desianated for literature purposes as Hsf fo, a.e6™opW/« .urface fibrils, and ,s 
.efe"rred to herein as HA2. This gene encodes a 2353-amino acid polypepttd. 
^ferredtoasHsf orHA2. with a calculated molecular mass of 243.8 IcDa. whtch 
,0 is stmilar in si« to Ute observed protein species detected after inducon of 

BL^l(DE3VpDC60.. The gene has a GC content of42.8%. somewhat greater 
tha^ the publishedes,imateof38-39%for the whole ge„ome(Fleischn,a„netal.. 

,,,5Whole.genomerandomsequencingandassemblyofH.™op«/..m/!-"- 

Rd. science. 269: 496-512.. Ki.ian. M. 1976. A ta.xonomic -"^ jf <'«^-- 
Hoe^opWus. wHth proposal of a new species. >. Gen. Microbiol. 93.9-6.... A 
p„.ativeHbosoma,bindingsitew,ththese,uenceAAOGTAbegmsl.^e,.,r 

upstreamof the pressed initiatio„codon.Asequences,milartoa..o-,ndepende„t 
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transcription tenminator is present beginning 20 nucleotides beyond the stop codon 
and contains interrupted inverted repeals with the potential for forming a hairpin 
structure containing a loop of two bases and a stem of 1 1 bases. Of note^ a string 
of 29 thymines spans the region from 149 to 121 nucleotides upstream of HA2. 

5 Homology to HAl/HAl 

The nontypable H, influenzae nonpilus protein HAl protein (called Hia in the 
literature) promotes attachment to cultured human epithelial cells as outlined above. 
Comparison of the predicted amino acid sequence of HA 2 and the sequence of HAl 
revealed 8 1 % similarity and 72% identity overall. As depicted in Figure 5, the two 

1 0 sequences are highly conserved at their N-terminal and C-terminal ends, and both 

contain a Walker box nucleotide-bindingmotif Interestingly. HA 1 is encoded by 
a 3.2 kb gene and is only 1 15-kDa. In this context, it is noteworthy that three 
separate stretches of HA2 (corresponding to amino acids 1 74 to 608. 847 to 1291, 
and 1476 to 1914, respectively) show significant homology to the region of HAl 

15 defined by amino acids 221 to 658 (Figure 5). Table 2 summarizes the level of 

similarity and identity between these three stretches of HA2 and one another. The 
suggestion is that the larger size of HA2 may relate in part to the presence of a 
repeated domain which is present in single copy in HAl. 

Table 2. Percent similarity and percent identity between HA2 repeats. 

20 Percent Similarity/Percent Identity 

HA2 1 74.608=* HA2 847-1291» HA21476^1914=» 

HA2 174-608 * 65/53 76/60 

HA2 847-1291 * 70/56 

HA2 1476-1914 * 

25 - "Numbers correspond to amino acid residue positions in the full-length HA2 (Hsf) 
protein. 
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To evaluate whether HA } and HA2 are alleles of the same locus, a series of Southern 
blots were performed. Samples ofchromosomalDNA from strains C54 and 1 1 were 
subjected to digestion with BgHh Cla\ and either Pst\ or Xbal. Resulting DNA 
fragments were separated by agarose electrophoresis and transferred bidirectionally 
to nitrocellulose membranes. One membrane was probed with a 3.3 kb internal 
fragment of the HA2 gene (Figure 7), and the other membrane was probed with a 
1.6 kb intragenic fragment of the HAl gene. As shown in Figure 9. both probes 
recognized exactly the same chromosomal fragments. 

To obtain additional evidence that the HA2 and HAJ genes are homologs. the 
inactivation of HA2 by transformation of H. influenzae strain C54bp with 
insertionallyinactivated/w; was attempted. TheplasmidpHMW8-6(Barenkamp. 
S.J.. and J.W. St. Geme. III. Identification of a second family of high molecular weight 
adhesion proteins expressed by nontypable Haemophilus influenzae. Mol. Microbiol., 
in press.), which contains the HA 1 gene with an intragenic kanamycin cassette, was 
linearized with Ndel and introduced into competent C54. Southern hybridization 
confirmed insertion of the kanamycin cassette into HA2 (not shown). Furthermore 
examination of the C54 mutant by negative staining transmission electron microscopy 
revealed the loss of surface fibrils (not shown). Consistent with these findings, the 
mutant strain demonstrated minimal attachmemto Chang conjunctival cells (Table 



20 1). 



In additional experiments, the cellular binding specificities conferred by the HA2 
and HAl proteins were compared. As shown in Figure 10. DH5a/pDC601 
(expressing HA2) demonstrated high level attachment to Chang cells, KB cells. HeLa 
cells, and lntestine407 cells, moderate level attachment to HEp-2 cells, and minimal 
attachment to HEC-IB cells. ME-180 cells, and CHO-Kl cells. DH5a harboring 
PHMW8.5 (expressing HAl) showed virtually the same pattern of attachment. 
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Giemsa staining and subsequent examination by light microscopy confimied these 
viable coimt adherence assay results. 

Homology to other bacterial extracellular proteins 

A protein sequence similarity search was performed with the HA2 predicted amino 
5 . acid sequence using the BLAST network service of the National Center for 
Biotechnology Infomiation (AltschuL S.F., W. Gish, W. Miller. E.W. Myers, and 
DJ. Lipman. 1990. Basis local alignment search tool. J. Mol. Biol. 215:403-410.). 
This search revealed low-level sequence similarity to a series of other bacterial 
adherence factors, including HMWl and HMW2 (the proteins previously identified 

10 as being important adhesins in HA 1 -deficient nontypable//. influeniae siTBins: (St. 

Geme, J. WJIL S. Falkow, and S J. Barenkamp. 1 993 . High-molecular-weight proteins 
of nontypable//aemop/i//i/5" influenzae mediate attachment to human epithelial cells. 
Proc. Natl. Acad. Sci. U.S.A. 90:2875-2879.), AIDA-I (an adhesion protein expressed 
by some diarrheagenic £. coli strains: Benz, I., and M.A. Schmidt. 1992. AIDA-L 

1 5 the adhesin involved in diffuse adherence of the diarrhoeagenic Escherichia coli strain 

2787 (0126:H27), is synthesized via a precursor molecule. Mol. Microbiol. 
6: 1 539-1 546.), and Tsh (a hemagglutinin produced by an avian pathogenic E. coli 
strain: Provence, D. and R. CurtissIII. 1994. Isolation and characterizationof a gene 
involved in hemagglutination by an avian pathogenic Escherichia coli strain. Infect 

20 Immun. 62:1369-1380.). In addition. HA2 showed homology to SepA. a Shigella 

flexneri secreted protein that appears to play a role in tissue invasion 
(Benjelloun-Touimi. Z.. P.J. Sansoneui, and C. Parsot. 1995. SepA, the major 
extracellular protein of Shigella flexnert autonomous secretion and involvement in 
tissue invasion. Mol. Microbiol. 17:123-135.). Alignment of HA2 with HMWl. 

25 HMW2, AIDA-L Tsh. and SepA revealed a highly conserved N-terminal domain 

(Figure 1 1). In AlDA-I.Tsh.and SepA. this N-terminal extremity precedes a typical 
procaryotic signal sequence (Benjelloun-Touimi.Z.. P.J. Sansonetti. and C. Parsot. 
1 995. SepA. the major extracellular protein of Shigella flexneri: autonomous secretion 
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^ ^.oU^r in .i^uc invasion. MC. Mioobiol. ,7:03.135., Su^^y. « HA^^ 
conse^ed a.n,.n pr-edes a 26 an,inc acid «^en. ,ha. is ch— ^ 
a p„si,i«,ychar.=dregion,foUo«edby asuingof hydrophob.cres.dues. and ,hen 

alanine-glutamine-alanine. 

Pres«.c.of.n»42hom.k«ia..her.ncp«-aud.nd.o,.enc.psub.cds,rains 

p.v.ousworMen,ons,n.«d*a.an™.honK>.ogisp«senU„H.>.^«.- 
. strains M« and Eagan ,S.. Genre, :.W..U. and D. Cuner, ,995. Ev.d=nce ^ 
surface f,bri.se.pressedhy H.e^p'..'- 'XP' » P""- ' " 

LnanepidreUa, ceiis. Mo,. Microhio,. ,5:77-S5.,. To define *e e«en, to wh,ch 
rHeH..oc«sissharedbyo*er.ypebs,rai„s.apane.ofevo,u.io„ar..yd,verse,Vpe 

. isoures by Sou*em anaiysis were examined. An,ong these s«ins were s,x 
belonging .o phy.ogenic division , and four belonging to phylogen.c d.vs.on ,1 
Deionging K J e „j o K Selander 1988. Evolutionary 

(Musser. J.M.. 3.S. KroU. E.R. Moxon, and R.K. Selander. 
genetics of the encapsulated strains of H<.e.op.«„. >.flu,n.a. Proc. Nati. A^. 

S A 85:7758.77«.,. Chronrosontal DK A was digested with B.ni an then 
p„.dwiththeintragenic3.3Kbfragn.en.ofthe...gene. ^-^^^^^ 
P3n.0s.rainsshowedhybridi^tion.Theuniversalpresencean,ongM,./I— 
Jpe b raised the question of the prevalence of this locus in other non-t>P= b 
encapsulated K >.fluen^. Southern analysis of a series of type a. c. d. e. 
isolates again demonstrated a homolog in all cases (Figure 13). 

Rece„t,yF,eisch„.annetaUFleischn,annR.D..eta,..1995.^^.,o,e-g^^^^^^^^^ 

scuencingand assembly Haen,opH>,us ,r.flue.-.e Rd. Scence. 269. 4 6ol..) 
rp^l*egenomese,uenceofH.M.e.-«strainRd.wbichwasoneo the wo 

i:pedlins examined by southern analysis. — ^ ^""'^7 
:ILsui.s.sea.hof,heRdgenomere..ealeda„openreading^ew,,hs..n 

Zncesimilaritv.o/«..TheRdgeneis894nuc,eo.idesinle„gthand,spred.c«d 

rZdeaprote;nof298 amino ac.ds.Overa„.theRd.ocus.s70..den.,ca, to 
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the C54 HA2 gene, and the Rd derived amino acid sequence is 62% identical and 
75% similar to C54 HA2. Interestingly, the Rd open reading frame appears to be 
truncated due to a "premature" stop codon. 

Previous experiments revealed that 13 of 15 nontypable strains lacking an 
5 HMW l/HMW2-relatedprotein had evidence of an /i4 7 homolog(Barenkamp,SJ., 

and J.W. St. Geme, III. Identification of a second family of high molecular weight 
adhesion proteins expressed by nontypable Haemophilus influenzae, Mol. MicrobioL, 
in press.). Consistent with the demonstration that HA2 and HA J are homologous. 
Southern analysis of these 15 strains, probing with the 3.3 kb fragment of hsf. 
10 demonstrated hybridization in 12 of the same 13 (not shovra). 

Chromosomal location of the HA2 locus 

In earlier work, the HA I locus in nontypable strain 1 1 was found to be flanked 
upstream by an open reading frame with significant homology to coli 
exoribonuclease II (Barenkamp. S J., and J.W. St. Geme. III. Identification of a second 

15 family of high molecular weight adhesion proteins expressed by nontypable 

Haemophilus influenzae. Mol. Microbiol., in press.). Similarly, the HA2 locus in 
strain C54 likewise is flanked on the 5* side by an open reading frame with similarit\ 
to E. coli exonuclease II. This gene terminates 357 base pairs before the HA2 start 
codon and encodes a protein with a predicted amino acid sequence that is 61 % similar 

20 and 33% identical at its C-ierminal end to exoribonuclease II. Of note, the Rd HA2 

homolog is also flanked upstream by the exoribonuclease II locus. 

EXAMPLE 3 
Cloning of HA3 

Recombinant phage containing the nontN^pable Haemophilus su-ain 32 HA3 gene were 
25 isolated and characterized using methods modified slightly from those described 
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previously (Barenkampand St. Geme. Molecular Microbiology 1996, in press). In 
brief chromosomalDNAfromstrain32waspreparedbyamodificationofthe method 

of Marmur (Marmur. 1961). Sau3A partial restriction digests of the DNA were 
preparedfractionatedonO.To/oagarosegels. Fractions containing DNA fragments 
in the 9- to 20- kbp range were pooled, and a library was prepared by ligation mto 
XEMBL3 arms. Ligation mixtures were packaged in vitro with Gigapack® 
(Stratagene, La Jolla. CA) and plate amplified in a P2 lysogen of E. coli LE392. 

Lambda plaque screening was performed using a mixture of three PGR products 
derived from strain 32 chromosomal DNA. These PGR products were amplified usmg 
primer pairs previously shown to amplify DNA segments at the 5' end of the stram 
1 1 HAl gene. The primers were as follows: 




Each of the positive strand primers was 
lo generate the three fragments used for probing the librarv' 



used with the single negative strand primer 



1 1 and strain 32 chromosomal DNA were 
eotide sequences of these chromosomal 
Plaque screening was performed using 



25 



The PGR products generated from stram 

iden.ica. in si^=. suggesing *a, .he nuclcmid. sequences of *ese chromoson,al 
regions were similar in the two strains, 
s^dard methodology (Berger and Kimmel. 1987, at high stringency, f.nal wash 
conditionswere 65C for 1 hour in buffer conuining2XSSC and 1% SDS. Positi« 
plaques were identified by autoradiography, plaque purified and phage DNA was 
purifiedbv standardmethods. The same primer pairs used ,o generatethe screen.ng 
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probes were then used to localize the HAS gene by amplifying various restriction 
fragments derived from the phage DNA. Once localized, the strain 32 HA3 gene 
and flanking DNA were sequenced using standard methods. 

In order to construct strain 32 isogenic Haemophilus influenzae mutants deficient 
in expression of the HA3 gene, bacteria were made competent using the MI V (Hemott 
et al. 1 970) and were transformed with linearized pHMW8-6, selecting for kanamycin 
resistance. Allelic exchange was confirmed by Southern analysis. The mutants that 
no longer expressed HA3 exhibited a marked decrease in binding to Chang epithelial 
cells, using the methods outlined above (data not shown). 

Expression in non-adherent strains of £. coli did not result in adherence, although 
it has not been confirmed that the protein was actually expressed. 
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(A) APPLICATION NUMBER: UNKNOWN 

(B) FILING DATE: 22-MAR-1996 

(C) CLASSIFICATION: 

(vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: US 08/403, 

(B) FILING DATE: 24-MAR-1995 

(A) TELEPHONE: (415) 781 x»o 
1b1 telefax: (415) 398-3249 
(C) TELEX: 910 277299 

(2) INFORMATION FOR SEQ ID NO : 1 : 

,i) SEQUENCE CHARACTERISTICS: 
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ATGAACAAAA 


TTTTTAACGT 


TATTTGGAAT 


GTTGTGACTC 


AAACTTGGGT 


TGTCGTATCT 


60 


GAACTCACTC 


GCACCCACAC 


CAAATGCGCC 


TCCGCCACCG 


TGGCGGTTGC 


CGTATTGGCA 


120 


ACCCTGTTGT 


CCGCAACGGT 


TGAGGCGAAC 


AACAATACTC 


CTGTTACGAA 


TAAGTTGAAG 


180 


GCTTATGGCG 


ATGCGAATTT 


TAATTTCACT 


AATAATTCGA 


TAGCAGATGC 


AGAAAAACAA 


240 


GTTCAAGAGG 


CTTATAAAGG 


TTTATTAAAT 


CTAAATGAAA 


AAAATGCGAG 


TGATAAACTG 


300 


TTGGTGGAGG 


ACAATACTGC 


GGCGACCGTA 


GGCAATTTGC 


GTAAATTGGG 


CTGGGTATTG 


360 


TCTAGCAAAA 


ACGGCACAAG 


GAACGAGAAA 


AGCCAACAAG 


TCAAACATGC 


GGATGAAGTG 


420 


TTGTTTGAAG 


GCAAAGGCGG 


TGTGCAGGTT 


ACTTCCACCT 


CTGAAAACGG 


CAAACACACC 


480 


ATTACCTTTG 


CTTTAGCGAA 


AGACCTTGGT 


GTGAAAACTG 


CGACTGTGAG 


TGATACCTTA 


540 


ACGATTGGCG 


GTGGTGCTGC 


TGCAGGTGCT 


ACAACAACAC 


CGAAAGTGAA 


TGTAACTAGT 


600 


ACAACTGATG 


GCTTGAAGTT 


CGCTAAAGAT 


GCTGCGGGTG 


CTAATGGCGA 


TACTACGGTT 


660 


CACTTGAATG 


GTATTGGTTC 


AACCTTGACA 


GACACGCTTG 


TGGGTTCTCC 


TGCTACTCAT 


720 


ATTGACGGAG 


GAGATCAAAG 


TACGCATTAC 


ACTCGTGCAG 


CAAGTATCAA 


GGATGTCTTG 


780 


AATGCGGGTT 


GGAATATCAA 


GGGTGTTAAA 


GCTGGCTCAA 


CAACTGGTCA 


ATCAGAAAAT 


840 


GTCGATTTTG 


TTCATACTTA 


CGATACTGTT 


GAGTTCTTGA 


GTGCGGATAC 


AGAGACCACG 


900 


ACTGTTACTG 


TAGATAGCAA 


AGAAAACGGT 


AAGAGAACCG 


AAGTTAAAAT 


CGGTGCGAAG 


960 


ACTTCTGTTA 


TCAAAGAAAA 


AGACGGTAAG 


TTATTTACTG 


GAAAAGCTAA 


CAAAGAGACA 


1020 


AATAAAGTTG 


ATGGTGCTAA 


CGCGACTGAA 


GATGCAGACG 


AAGGCAAAGG 


CTTAGTGACT 


1080 


GCGAAAGATG 


TGATTGACGC 


AGTGAATAAG 


ACTGGTTGGA 


GAATTAAAAC 


AACCGATGCT 


1140 


AATGGTCAAA 


ATGGCGACTT 


CGCAACTGTT 


GCATCAGGCA 


CAAATGTAAC 


CTTTGCTAGT 


1200 


GGTAATGGTA 


CAACTGCGAC 


TGTAACTAAT 


GGCACCGATG 


GTATTACCGT 


TAAGTATGAT 


1260 


GCGAAAGTTG 


GCGACGGCTT 


AAAACTAGAT 


GGCGATAAAA 


TCGCTGCAGA 


TACGACCGCA 


1320 


CTTACTGTGA 


ATGATGGTAA 


GAACGCTAAT 


AATCCGAAAG 


GTAAAGTGGC 


TGATGTTGCT 


1380 




AGAAGAAATT 


GGTTACAGCA 


AAAGGTTTAG 


TAACAGCCTT 


AAACAGTCTA 


1440 


AGCTGGACTA 


CAACTGCTGC 


TGAGGCGGAC 


GGTGGTACGC 


TTGATGGAAA 


TGCAAGTGAG 


1500 


CAAGAAGTTA 


AAGCGGGCGA 


TAAAGTAACC 


TTTAAAGCAG 


GCAAGAACTT 


AAAAGTGAAA 


1560 


CAAGAGGGTG 


CGAACTTTAC 


TTATTCACTG 


CAAGATGCTT 


TAACAGGCTT 


AACGAGCATT 


1620 


ACTTTAGGTA 


CAGGAAATAA 


TGGTGCGAAA 


ACTGAAATCA 


ACAAAGACGG 


CTTAACCATC 


1680 
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»CACCAGC« ATOOTCCGCG TGC«»T«T GC«A«CCA TC*«»T«C CAAAGACGGC 
ATTAGTGCGO GCGGTCAGTC G=TT»«AC GTTGTGAOCO GACTGAAGAA ATTTGGTGAT 
GCGAATTTCG ATCCGCTGAC TAGCTCCGCC GACAACTT» CGA«C«« TGAGGATGCC 
TAT»AGGCT TGACGAATTT GGATGA«» GGTACAGACA AGCA^CTCC AGTTGTTGCC 
GACAATACCG CCGCAACCGT GGGCGATTTO CGCGGCTTGG GCTGGGTGAT TTCTGCGGAC 
AAAACCACAG GCGGCTCAAC GGAATATCAC GATCWGTTC GGAATGCGAA CGAAGTGAM 
TTCAAAAGCG GGAACGGTAT CAATGTTTCC GGTAMAGGG TCAACGGTAG GCGTGAAATT 
ACTTTTGAAT TGGCTAAAGG TGAAGTGGTT A«TCGAATG AATTTACCGT CAAAGAAACC 
AAT«3AAAGG AAA=GAGC=:T GGTTAAAGTT GGCGATAAAT ATTACAGCAA AGAGGATATT 
GACTTAACAA GAGGTCAGCC TAAATTAAAA GATGGCAATA CAGTTGCTGC GAAATATCAA 
GATAAAGGTG GCAAAGTCGT TTCTGTAACG GATAATACTG AAGCTACCAT AACCAACAAA 
GGTTCTGGCT ATGTAACAGG TAAGCAAGTG GCAGATGCGA TTGCGAAATC AGGCTTTGAG 
CTTGGCTTGG CTGATGAAGC TGATGCGAAA CGGGCGTTTG ATGATAAGAC AAAAGCCTTA 
TCTGCTGGTA CAACGGAAAT TGTAAATGCC CACGATAAAG TCCGTTTTGC TAATGGTTTA 
AATACCAAAO TGAGCOCGGC AACGGTGGAA AGCACCGATG GAAACGGCGA TAAAGTGACC 

ACAACcrrrro tgaaaaccga tgtggaattg ccrrrAAcGC aaatgtacaa taccgatgga 

«GGGTAAGA AAATCACTAA AGTTGTGAAA GATGGGCAAA CTAAATGGTA TGAACTGAAT 
GCTGACGGTA CGGCTGATAT GAGCAAAGAA GTTACCCTCG GTAACGT<=GA TTCAGACGGC 
AAGAAAGTTG TGAAAGACAA GGATGGCAAG TGGTATCACG CCAAAGCTGA CGGTACTGCG 
GATAAAACCA AAGGGGAAGT GAGCAATGAT AAAGTTTCTA GCGATGAAAA ACAGGTTGTC 
AGCCTTGATC CAAATGATCA ATCAAAAGGT AAAGGT^TCG TGATTGACAA TGTGGGTAAT 
GGCGATATTT CTGCCACTTC CACCGATGCG ATTAACGGAA GTGAGTTGTA TGCTGTGGCA 
;^GGTAA CAAACCTTGG TGGACAAGTG AATAATGTTG AGGGCAAAGT GAATAAAGTG 
GGCAAACGTG GAGATGCAGG TACAGCAAGT GGATTAGCGG GTTCACAG.T ACCACAAGCC 
ACTATGCCAG GTAAATCAAT GGTTGCTATT GCGGGAAGTA GTTATCAAGG TCAAAATGGT 
TTAGCTATCG GGGTATCAAG AATTTCCGAT AATGGCAAAG TGATTATTCG CTTGTCAGGC 
AGAACCAATA GTCAAGGTAA AACAGGCGTT GCAGCAGGTG TTGGTTACCA GTGG 



1800 

1860 

1920 

1980 

2040 

2100 

2160 

2220 

2280 

2340 

2400 

2460 

2520 

2580 

2640 

2700 

2760 

2820 

2B80 

2940 

3000 

3060 

3120 

3180 

3240 

3294 
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(2) INFORMATION FOR SEQ ID NO : 2 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1098 amino acids 

(B) TYPE: amino acid 

( C ) STRANDEDNE S S : un)cno wn 

( D ) TOPOLOGY : iinknown 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 2 : 

' Met Asn Lys He Phe Asn Val He Trp Asn Val Val Thr Gin Thr Trp 
1 5 10 15 

Val Val Val Ser Glu Leu Thr Arg Thr His Thr Lys Cys Ala Ser Ala 
20 25 30 

Thr Val Ala Val Ala Val Leu Ala Thr Leu Leu Ser Ala Thr Val Glu 
35 40 45 

Ala Asn Asn Asn Thr Pro Val Thr Asn Lys Leu Lys Ala Tyr Gly Asp 
50 55 60 

Ala Asn Phe Asn Phe Thr Asn Asn Ser He Ala Asp Ala Glu Lys Gin 
65 70 75 BO 

Val Gin Glu Ala Tyr Lys Gly Leu Leu Asn Leu Asn Glu Lys Asn Ala 
85 90 95 

Ser Asp Lys Leu Leu Val Glu Asp Asn Thr Ala Ala Thr Val Gly Asn 
100 105 110 

Leu Arg Lys Leu Gly Trp Val Leu Ser Ser Lys Asn Gly Thr Arg Asn 
115 120 125 

Glu Lys Ser Gin Gin Val Lys His Ala Asp Glu Val Leu Phe Glu Gly 
130 135 140 

Lvs Gly Gly Val Gin Val Thr Ser Thr Ser Glu Asn Gly Lys His Thr 
145 150 155 160 

He Thr Phe Ala Leu Ala Lys Asp Leu Gly Val Lys Thr Ala Thr Val 
165 170 175 

Ser Asp Thr Leu Thr He Gly Gly Gly Ala Ala Ala Gly Ala Thr Thr 
180 185 190 

Thr Pro Lys Val Asn Val Thr Ser Thr Thr Asp Gly Leu Lys Phe Ala 
195 200 205 

'lys Asp Ala Ala Gly Ala Asn Gly Asp Thr Thr Val His Leu Asn Gly 
210 215 220 

He Gly Ser Thr Leu Thr Asp Thr Leu Val Gly Ser Pro Ala Thr His 
225 230 235 240 
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II. ASP Giy =ly -=1" "s 

«p v,l l^u «n .la Cly Trp «n xle .ys oly v.l .ys «a oly 

260 265 

s.. T« Th. Oly .in ser =lu v,l ASP P.. val His Ty. «P 

275 



Thr- Glu Thr Thr Thr Val Thr Val 
Thr val Glu Phe Leu Ser Ala Asp Thr Glu Tnr 

290 

«p s,r .ys «V -vs A.9 T>,r =lu val .ys .le oly ^a .ys 

310 

rr se, val 11. -ys Olu .ys "P «V .ya "^^^ 

325 "^"^^ 

.ya Olu Aa„ .ys Val «P =ly "a As. .1. Th, olu «p Ma 

340 

„p alu Oly .ya oly l^u val TK. Ma .ya val U. «P M- val 
.y, tL Oly Trp .r, Ila .ya T« T« Aap Ma «n Oly oln A=» 

Oly Tp .ha Ma Th, val Ma Se. Oly Thr val Thr Pha Ma Sar 

390 

liy .s„ Oly Thr Thr M. Thr Val Thr Oly Thr «P =ly 11a Thr 

T » ^7=^ Glv AST> Gly Leu Lys Leu Asp Gly Asp 
val Lys Tyr Asp Ala Lys Val Gly Asp eiy 
420 *25 

11. Ma Ma «P Thr Thr M. ..u Thr val «n «p Oly .y. -h 
435 

zi o Pro Lvs Gly Lys val Ala Asp Val Ala Ser Thr Asp Glu 
Ala Asn Asn Pro Lys L.iy i-y= 

450 '^^^ 



. H-v,, una LVS Gly Leu Val Thr Ala Leu Asn Ser Leu 
Val Thr Ala ijys ^xy ago 

470 '^^^ 
S.r Trp Thr Thr Thr Ma Ma Olu Ma ^^P Oly Oly Thr I.U Aap Oly 



Lvs Lys Leu 

470 

465 



485 



.sn Ala ser Glu Gin Glu Val Lys Ala Gly Asp Lys Val Thr Phe Lys 



500 



. T.u LVS val Lys Gin Glu Gly Ala Asn Phe Thr Tyr 
Ala Gly Lys Asn Leu Lys vai ^y ^^5 
515 

Leu Thr Gly Leu Thr Ser He Thr Leu Gly Thr 
ser Leu Gin Asp Ala Leu Thr >iy ^ 

530 
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Gly Asn Asn Gly Ala Lys Thr Glu lie Asn Lys Asp Gly Leu Thr lie 
545 550 555 560 

Thr Pro Ala Asn Gly Ala Gly Ala Asn Asn Ala Asn Thr lie Ser Val 
565 570 575 

Thr Lys Asp Gly lie Ser Ala Gly Gly Gin Ser Val Lys Asn Val Val 
580 585 590 

Ser Gly Leu Lys Lys Phe Gly Asp Ala Asn Phe Asp Pro Leu Thr Ser 
595 600 605 

Ser Ala Asp Asn Leu Thr Lys Gin Asn Asp Asp Ala Tyr Lys Gly Leu 
610 615 620 

Thr Asn Leu Asp Glu Lys Gly Thr Asp Lys Gin Thr Pro Val Val Ala 
625 630 635 640 

Asp Asn Thr Ala Ala Thr Val Gly Asp Leu Arg Gly Leu Gly Trp Val 
645 650 655 

lie Ser Ala Asp Lys Thr Thr Gly Gly Ser Thr Glu Tyr His Asp Gin 
660 665 670 

Val Arg Asn Ala Asn Glu Val Lys Phe Lys Ser Gly Asn Gly lie Asn 
675 680 685 

Val Ser Gly Lys Thr Val Asn Gly Arg Arg Glu lie Thr Phe Glu Leu 
690 695 700 

Ala Lys Gly Glu Val Val Lys Ser Asn Glu Phe Thr Val Lys Glu Thr 
705 710 715 720 

Asn Gly Lys Glu Thr Ser Leu Val Lys Val Gly Asp Lys Tyr Tyr Ser 
725 730 735 

Lys Glu Asp lie Asp Leu Thr Thr Gly Gin Pro Lys Leu Lys Asp Gly 
740 745 750 

Asn Thr Val Ala Ala Lys Tyr Gin Asp Lys Gly Gly Lys Val Val Ser 
755 760 765 

Val Thr Asp Asn Thr Glu Ala Thr lie Thr Asn Lys Gly Ser Gly Tyr 
770 775 780 

Val Thr Gly Asn Gin Val Ala Asp Ala He Ala Lys Ser Gly Phe Glu 
785 790 795 800 

Leu Gly Leu Ala Asp Glu Ala Asp Ala Lys Arg Ala Phe Asp Asp Lys 
805 810 815 

Thr Lys Ala Leu Ser Ala Gly Thr Thr Glu He Val Asn Ala His Asp 
820 825 830 

Lys Val Arg Phe Ala Asn Gly Leu Asn Thr Lys Val Ser Ala Ala Thr 
835 840 845 
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Val Glu Ser Thr Asp 
850 



Ala Asn Gly Asp Lys Val Thr Thr Thr Phe Val 



855 



860 



Lys Thr Asp Val Glu Leu Pro 



Leu Thr Gin lie Tyr Asn Thr Asp Ala 



870 



865 

Asn Oly .ys I-ys II. Thr L.ys val v.l Lys Asp 31y Gin Thr Lys Trp 

885 

Tyr Glu Leu Asn Ala Asp Gly Thr Ala Asp Met Thr Lys Glu Val Thr 



875 



880 



900 



l.eu Gly Asn Val Asp Ser Asp Gly Lys I.ys Val Val Lys Asp Asn Asp 

920 



915 

Gly Lys Trp Tyr His Ala Lys Ala Asp Gly Thr Ala Asp X.ys Thr Lys 

930 

Gly Glu val ser Asn Asp Lys Val Ser Thr Asp Glu Lys His Val Val 



945 



950 



955 



Tv^^ c;-*,^ Lvs Glv Lys Gly Val Val He Asp 

Ser Leu Asp Pro Asn Asp Gin Ser Lys ^xy y 

965 

Asn val Ala Asn Gly Asp He Ser Ala Thr Ser Thr Asp Ala He Asn 

980 

Gly ser Gin Leu Tyr Ala Val Ala Lys Gly Val Thr Asn^Leu Ala Gly 
995 1000 

Gin val Asn Asn Leu Glu Gly Lys Val Asn Lys Val Gly Lys Arg Ala 
1010 1015 1020 

;.s. Ala Gly Thr Ala Ser Ala Leu Ala Ala Ser Gin Leu Pro Gin Ala^ 
1025 1030 1035 

Thr Mec pro Gly Lys Ser Met Val Ala lie Ala Gly Ser Ser Tyr^Gln 
1045 ^^^^ 



OIV Olh «. Gly Ala Ila Gly val ser Arg He Ser Asp Gly 

1060 1°^^ 
I,ys val He lie Arg Leu Ser Gly Thr Thr Asn Ser Gln^Gly Lys Thr 
1075 

Gly val Ala Ala Gly Val Gly Tyr Gin Trp 
1090 ^0^5 . 

(2) INFORMATION FOR SEQ ID NO: 3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 7291 base pairs 
(B> TYPE: nucleic acid 

(C) STRANDEDNESS : unknown 

( D ) TOPOLOGY : unknown 
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(ii) MOLECULE TYPE: DNA (genomic) 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 163.. 7221 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 3 : 

TTT N TT TT TC TTATTTTTTT TTTTTTTTTT TTTTTTTTTT TTGAGGCTAA ACTTTTNGNA 60 

AAATATCACT TTTTTATTCT CCAAATATAG AATAGAATAC GCACGATTTC ACTAAGAAAA 120 

GTATATTTAT CATTAATTTT ATTAAATATA AGGTAAATAA AA ATG AAC AAA ATT 174 

Met Asn Lys lie 
1 

TTT AAC GTT ATT TGG AAT GTT ATG ACT CAA ACT TGG GTT GTC GTA TCT 222 
Phe Asn Val He Trp Asn Val Met Thr Gin Thr Trp Val Val Val Ser 
5 10 15 20 



GAA CTC ACT CGC ACC CAC ACC AAA CGC GCC TCC GCA ACC GTG GAG ACC 
Glu Leu Thr Arg Thr His Thr Lys Arg Ala Ser Ala Thr Val Glu Thr 
25 30 35 



270 



GCC GTA TTG GCG ACA CTG TTG TTT GCA ACG GTT CAG GCG AAT GCT ACC 318 
Ala Val Leu Ala Thr Leu Leu Phe Ala Thr Val Gin Ala Asn Ala Thr 
40 45 50 

GAT GAA GAT GAA GAG TTA GAC CCC GTA GTA CGC ACT GCT CCC GTG TTG 3 66 

Asp Glu Asp Glu Glu Leu Asp Pro Val Val Arg Thr Ala Pro Val Leu 
55 60 65 

AGC TTC CAT TCC GAT AAA GAA GGC ACG GGA GAA AAA GAA GTT ACA GAA 414 
Ser Phe His Ser Asp Lys Glu Gly Thr Gly Glu Lys Glu Val Thr Glu 
70 75 80 

AAT TCA AAT TGG GGA ATA TAT TTC GAC AAT AAA GGA GTA CTA AAA GCC 46 2 

Asn Ser Asn Trp Gly He Tyr Phe Asp Asn Lys Gly Val Leu Lys Ala 
85 90 95 100 

GGA GCA ATC ACC CTC AAA GCC GGC GAC AAC CTG AAA ATC AAA CAA AAC 510 
Gly Ala He Thr Leu Lys Ala Gly Asp Asn Leu Lys He Lys Gin Asn 
105 HO 115 

ACC GAT GAA AGC ACC AAT GCC AGT AGC TTC ACC TAC TCG CTG AAA AAA 55 8 

Thr Asp Glu Ser Thr Asn Ala Ser Ser Phe Thr Tyr Ser Leu Lys Lys 
120 125 130 

GAC CTC ACA GAT CTG ACC AGT GTT GCA ACT GAA AAA TTA TCG TTT GGC 606 
Asp Leu Thr Asp Leu Thr Ser Val Ala Thr Glu Lys Leu Ser Phe Gly 
135 140 145 

GCA AAC GGC GAT AAA GTT GAT ATT ACC AGT GAT GCA AAT GGC TTG AAA 6 54 

Ala Asn Gly Asp Lys Val Asp He Thr Ser Asp Ala Asn Gly Leu Lys 
150 155 160 
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TTG GCG AAA ACA GGT AAC GGA AAT GTT CAT TTG AAT GGT TTG GAT TCA 
Leu Ala Lys Thr Gly Asn Gly Asn Val His Leu Asn Gly Leu Asp Ser 
165 170 l-'S 180 

ACT TTG CCT GAT GCG GTA ACG AAT ACA GGT GTG TTA AGT TCA TCA AGT 
Thr Leu pro Asp Ala Val Thr Asn Thr Gly Val Leu Ser Ser Ser Ser 
185 19S 

TTT ACA CCT AAT GAT GTT GAA AAA ACA AGA GCT GCA ACT GTT AAA GAT 
pSI Thr Pro Asn Asp Val Glu Lys Thr Arg Ala Ala Thr Val Lys Asp 
200 205 210 

GTT TTA AAT GCA GGT TGG AAC ATT AAA GGT GCT AAA ACT GCT GGA GGT 
vll leu Ala Gly Trp Asn He Lys Gly Ala Lys Thr Ala Gly Gly 

215 220 225 

AAT GTT GAG AGT GTT GAT TTA GTG TCC GCT TAT AAT AAT GTT GAA TTT 
Asn val Glu Ser Val Asp Leu Val Ser Ala Tyr Asn Asn Val Glu Phe 
230 235 240 

ATT ACA GGC GAT AAA AAC ACG CTT GAT GTT GTA TTA ACA GCT AAA GAA 
He Thr Gly Asp Lys Asn Thr Leu Asp Val Val Leu Thr Ala Lys Glu 
245 250 255 260 

AAC GGT AAA ACA ACC GAA GTG AAA TTC ACA CCG AAA ACC TCT GTT ATC 
Asn Gly Lys Thr Thr Glu Val Lys Phe Thr Pro Lys Thr Ser Val He 
265 270 275 



AAA GAA AAA GAC GGT AAG TTA TTT ACT GGA AAA GAG AAT AAC GAC ACA 
Lys Glu Lys ASP Gly Lys Leu Phe Thr Gly Lys Glu Asn Asn Asp Thr 
280 285 290 

AAT AAA GTT ACA AGT AAC ACG GCG ACT GAT AAT ACA GAT GAG GGT AAT 
Asn Lys val Thr Ser Asn Thr Ala Thr Asp Asn Thr Asp Glu Gly Asn 
295 300 

GGC TTA GTC ACT GCA AAA GCT GTG ATT GAT GCT GTG AAC AAG GCT GGT 
Gly Leu val Thr Ala Lys Ala val He Asp Ala Val Asn Lys Ala Gly 
310 315 ^20 

TGG AGA GTT AAA ACA ACT ACT GCT AAT GGT CAA AAT GGC GAC TTC GCA 
Trp Arg Val Lys Thr Thr Thr Ala Asn Gly Gin Asn Gly Asp Phe Ala 
325 330 335 340 

ACT GTT GCG TCA GGC ACA AAT GTA ACC TTT GAA AGT GGC GAT GGT ACA 
rZ vll Ala ser Gly Thr Asn Val Thr Phe Glu Ser Gly Asp Gly Thr 
345 350 355 

APA GCG TCA GTA ACT AAA GAT ACT AAC GGC AAT GGC ATC ACT GTT AAG 
?hr Sa S^^ ?hr Lys Asp Thr Asn Gly Asn Gly He Thr Val Lys 

360 365 370 

TAC GAC GCG AAA GTT GGC GAC GGC TTG AAA TTT GAT AGC GAT AAA AAA 
^yr 2p Ala Lys Val Gly Asp Gly Leu Lys Phe Asp Ser Asp Lys Lys 

ion 38b 

375 



702 



750 



798 



646 



894 



942 



990 



10 3 8 



1086 



1134 



1182 



1230 



1278 



1326 
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ATC GTT GCA GAT ACG ACC GCA CTT ACT GTG ACA GGT GGT AAG GTA GCT 1374 
lie Val Ala Asp Thr Thr Ala Leu Thr Val Thr Gly Gly Lys Val Ala 
390 395 400 

GAA ATT GCT AAA GAA GAT GAC AAG AAA AAA CTT GTT AAT GCA GGC GAT 1422 
Glu lie Ala Lys Glu Asp Asp Lys Lys Lys Leu Val Asn Ala Gly Asp 
405 410 415 420 

TTG GTA ACA GCT TTA GGT AAT CTA AGT TGG AAA GCA AAA GCT GAG GCT 14 70 

Leu Val Thr Ala Leu Gly Asn Leu Ser Trp Lys Ala Lys Ala Glu Ala 
425 430 435 

GAT ACT GAT GGT GCG CTT GAG GGG ATT TCA AAA GAC CAA GAA GTC AAA 1518 
Asp Thr Asp Gly Ala Leu Glu Gly lie Ser Lys Asp Gin Glu Val Lys 
440 445 450 

GCA GGC GAA ACG GTA ACC TTT AAA GCG GGC AAG AAC TTA AAA GTG AAA 1566 
Ala Gly Glu Thr Val Thr Phe Lys Ala Gly Lys Asn Leu Lys Val Lys 
455 460 465 

CAG GAT GGT GCG AAC TTT ACT TAT TCA CTG CAA GAT GCT TTA ACG GGT 1614 
Gin Asp Gly Ala Asn Phe Thr Tyr Ser Leu Gin Asp Ala Leu Thr Gly 
470 475 480 

TTA ACG AGC ATT ACT TTA GGT GGT ACA ACT AAT GGC GGA AAT GAT GCG 1662 
Leu Thr Ser lie Thr Leu Gly Gly Thr Thr Asn Gly Gly Asn Asp Ala 
485 490 495 500 

AAA ACC GTC ATC AAC AAA GAC GGT TTA ACC ATC ACG CCA GCA GGT AAT 1710 
Lys Thr Val lie Asn Lys Asp Gly Leu Thr lie Thr Pro Ala Gly Asn 
505 510 515 

GGC GGT ACG ACA GGT ACA AAC ACC ATC AGC GTA ACC AAA GAT GGC ATT 1758 
Gly Gly Thr Thr Gly Thr Asn Thr lie Ser Val Thr Lys Asp Gly lie 
520 525 530 

AAA GCA GGT AAT AAA GCT ATT ACT AAT GTT GCG AGT GGT TTA AGA GCT 1806 
Lys Ala Gly Asn Lys Ala lie Thr Asn Val Ala Ser Gly Leu Arg Ala 
535 540 545 

TAT GAC GAT GCG AAT TTT GAT GTT TTA AAT AAC TCT GCA ACT GAT TTA 18 54 

Tyr Asp Asp Ala Asn Phe Asp Val Leu Asn Asn Ser Ala Thr Asp Leu 

550 555 560 

AAT AGA CAC GTT GAA GAT GCT TAT AAA GGT TTA TTA AAT CTA AAT GAA 1902 
Asn Arg His Val Glu Asp Ala Tyr Lys Gly Leu Leu Asn Leu Asn Glu 
565 570 575 580 

AAA AAT GCA AAT AAA CAA CCG TTG GTG ACT GAC AGC ACG GCG GCG ACT 1950 
Lys Asn Ala Asn Lys Gin Pro Leu" Val Thr Asp Ser Thr Ala Ala Thr 
585 590 595 

GTA GGC GAT TTA CGT AAA TTG GGT TGG GTA GTA TCA ACC AAA AAC GGT 1998 
Val Gly Asp Leu Arg Lys Leu Gly Trp Val Val Ser Thr Lys Asn Gly 
600 605 610 
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. r^.^ ^rr AAT CAA GTT AAA CAA GCT GAT GAA GTC CTC TTT 

!^^,T.Tu vax .ys «. «u va. PH. 

615 

z s s ?s ?s s s X z 
St: i tii ?s ^ s J vi? s ?s ^ s: 
zi z z S t- ^ ^ ^i' zi 
z -I i s: ^» z ^" z zi s ^ - 

s X T. ?s zz^^ 

695 . 

,.nT rrT AAT GAC GCT GAT AAG AAA GTC GCA 

^ Z =^ ... ^y. v.. 

710 '^^^ 

^^r^ r-na iiTT AAT AGT GCG GCG ACT TTT GTG 
ACT GTA AAA GAT GTT GCA ACC GCA ATT AAT AGT 
Thr val Lys Asp Val Ala Thr Aia J.xe 
725 -^30 

ar-r TCT ATT GAT GAA GAT AAT CCT ACA GAT 
AAA ACA GAG AAT TTA ACT ACC TCT ATT GAT 
Lys Thr Glu Asn Leu Thr Thr Ser He Asp Glu p 
745 ^ 

r^^n rrA CTT AAA GCG GGC GAT ACC TTA ACC TTT AAA 
AAC GGC AAA GAT GAC GCA CTT ^ ^ ^ Leu Thr Phe Lys 

Asn Gly Lys Asp Asp Ala Leu Lys Ala Gly Asp Thr 
760 

GTT AAA CGT GAT GGA AAA AAT ATT ACT TTT 
GCA GGT AAA AAC CTG AAA GTT AAA CG 
Ala Gly Lys Asn Leu Lys Val Lys Arg a p 
775 

^ r^Tvr- rTG AAA ACT GCG AAA GTG AGT GAT ACT 
r Sa ^ S ^ ^vs val «p Thr 

Asp Leu Ala Lys Asn 

790 ^ 

^rzi rCT ACA GGT GGC ACT ACT GCG ACG CCA 
TTA ACG ATT GGC GGG AAT ACA CCT ACA G 
Leu Thr He Gly Gly Asn Thr Pro Thr Gly y 
805 

.or^ nrr GCT GAT GGT TTG AAT TTT GCA AAA GAA 
AAA GTG AAT ATT ACT AGC ACG GCT GAT G 
Lys val Asn He Thr Ser Thr Ala Asp Gly 



2046 



2094 



2142 



2190 



2238 



2286 



2334 



2382 



2430 



2478 



2526 



2574 



2622 



2670 
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ACA GCC GAT GCC TCG GGT TCT AAG AAT GTT TAT TTG AAA GGT ATT GCG 2718 
Thr Ala Asp Ala Ser Gly Ser Lys Asn Val Tyr Leu Lys Gly lie Ala 
840 845 850 

ACA ACT TTA ACT GAG CCA AGC GCG GGA GCG AAG TCT TCA CAC GTT GAT 2766 
Thr Thr Leu Thr Glu Pro Ser Ala Gly Ala Lys Ser Ser His Val Asp 
855 860 865 

TTA AAT GTG GAT GCG ACG AAA AAA TCC AAT GCA GCA AGT ATT GAA GAT 2814 
Leu Asn Val Asp Ala Thr Lys Lys Ser Asn Ala Ala Ser lie Glu Asp 
870 875 880 

GTA TTG CGC GCA GGT TGG AAT ATT CAA GGT AAT GGT AAT AAT GTT GAT 2862 
Val Leu Arg Ala Gly Trp Asn lie Gin Gly Asn Gly Asn Asn Val Asp 
885 890 895 900 

TAT GTA GCG ACG TAT GAC ACA GTA AAC TTT ACC GAT GAC AGC ACA GGT 2910 
Tyr Val Ala Thr Tyr Asp Thr Val Asn Phe Thr Asp Asp Ser Thr Gly 
905 910 915 

ACA ACA ACG GTA ACC GTA ACC CAA AAA GCA GAT GGC AAA GGT GCT GAC 2 958 

Thr Thr Thr Val Thr Val Thr Gin Lys Ala Asp Gly Lys Gly Ala Asp 
920 925 930 

GTT AAA ATC GGT GCG AAA ACT TCT GTT ATC AAA GAC CAC AAC GGC AAA 3006 
Val Lys lie Gly Ala Lys Thr Ser Val lie Lys Asp His Asn Gly Lys 
935 940 945 

CTG TTT ACA GGC AAA GAC CTG AAA GAT GCG AAT AAT GGT GCA ACC GTT 3054 
Leu Phe Thr Gly Lys Asp Leu Lys Asp Ala Asn Asn Gly Ala Thr Val 
950 955 960 

AGT GAA GAT GAT GGC AAA GAC ACC GGC ACA GGC TTA GTT ACT GCA AAA 3102 
Ser Glu Asp Asp Gly Lys Asp Thr Gly Thr Gly Leu Val Thr Ala Lys 
965 970 975 980 

ACT GTG ATT GAT GCA GTA AAT AAA AGC GGT TGG AGG GTA ACC GGT GAG 3150 
Thr Val lie Asp Ala Val Asn Lys Ser Gly Trp Arg Val Thr Gly Glu 
985 990 995 

GGC GCG ACT GCC GAA ACC GGT GCA ACC GCC GTG AAT GCG GGT AAC GCT 3198 
Gly Ala Thr Ala Glu Thr Gly Ala Thr Ala Val Asn Ala Gly Asn Ala 
1000 1005 1010 

GAA ACC GTT ACA TCA GGC ACG AGC GTG AAC TTC AAA AAC GGC AAT GCG 324 6 

Glu Thr Val Thr Ser Gly Thr Ser Val Asn Phe Lys Asn Gly Asn Ala 
1015 1020 1025 

ACC ACA GCG ACC GTA AGC AAA GAT AAT GGC AAC ATC AAT GTC AAA TAC 32 94 

Thr Thr Ala Thr Val Ser Lys Asp Asn Gly Asn lie Asn Val Lys Tyr 
1030 1035 1040 

GAT GTA AAT GTT GGT GAC GGC TTG AAG ATT GGC GAT GAC AAA AAA ATC 3 34 2 

Asp Val Asn Val Gly Asp Gly Leu Lys lie Gly Asp Asp Lys Lys lie 
1045 1050 1055 1060 
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^rr GTA ACA GGT GGT AAG GTG TCT GTT 

^ zi ^ r.? ..^c., ..v se.^v,. 

1065 

- cc. cc. - - - - - - - 511 

Pro Ala Gly Ala Asn Ser vai Asn ^^^^ 
1080 

Ajvf ARC CTA AGC TGG ACQ GCA AAA GCC 
G=T ™ =C. ACT OCT ™ «C CTA 

Glu Gly Leu Ala Ttir Aia ueu ^^^^ 

1095 

- - r s ti; z IS st^ it. ?s - - - 

ASP Lys Tyr Ala Asp eiy ^-^ ^^^20 
1110 

o-r. Arc TTT AAA GCA GGC AAG AAC TTA AAA GTG 

=S ^ - «. -V^^vs - - - 

1125 ^-^^^ 

^ „o Tc o«. - 12 s:; 2^ tS ?s 

Lys Gin Ser Glu Lys Asp Phe Thr Tyr ^ ^^^^ 
1145 

Tvr-a rrr AAT GGC AGA AAT GAT 
OGC TTA ACG AGC ATT ACT TTA GGT GGT ACA GCT AAT G^^ 

Gly Leu Thr Ser He Thr Leu Gly ^^f-. 1170 
1160 

.CG GGA ACC GTC ATC AAC AAA GAC GGC TTA .CC ATC ACG CTG GCA AAT 
Thr Gly Thr Val He Asn Lys Asp Gly Leu T ^^^^ 
1175 

n^T rCG TCT AAC GGA AAC ACC ATC AGT GTA 

Z S S S Z Si Zp - ... ™^ 

.r. =c= CO. ^ - .T. .cc .XT «o 

Thr Lys ASP Gly He Ser Ala Gly Asn Ly ^^20 
1205 ^210 

^.-r AAA GAT ACT CAA AAC ACT GCA GAT GAA ACA 
AGT GCT TTA AAA ACC TAT AAA GAT ACT CAA 

ser Ala Leu Lys Thr Tyr Lys Asp Thr ^ ^^35 
1225 

nrr TTT AAA AAC GCA AAT GAA GTT GAG 
CAA GAT AAA GAG TTC CAC GCC GCC GTT AAA 

Gin ASP Lys Glu Phe His Ala Ala vai^ y ^^^^ 
1240 

r.^r Arr CTG TCT GCA AAA ACT GAT AAC AAC 
„C GTC GOT »A «C GOT GCA ACC GTG TC^ ^„ 

Phe Val Gly Lys Asn Gly Ala in ^^^^ 

1255 

.nn GAT GTT GCA GAA GCC AAA GTT GGT GAT 

GGA AAA CAT ACT GTA ACG ATT GAT GTT 
Gly Lys His Thr Val Thr lie Asp ^^^^ 



3390 



3438 



3486 



3534 



3582 



3630 



3678 



3726 



3774 



3822 



3870 



3918 



3966 



4014 



1275 

1270 
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GGT CTT GAA AAA GAT ACT GAC GGC AAG ATT AAA CTC AAA GTA GAT AAT 4 062 

Gly Leu Glu Lys Asp Thr Asp Gly Lys He Lys Leu Lys Val Asp Asn 
1285 1290 1295 1300 

ACA GAT GGG AAT AAT CTA TTA ACC GTT GAT GCA ACA AAA GGT GCA TCC 4110 
Thr Asp Gly Asn Asn Leu Leu Thr Val Asp Ala Thr Lys Gly Ala Ser 
1305 1310 1315 

GTT GCC AAG GGC GAG TTT AAT GCC GTA ACA ACA GAT GCA ACT ACA GCC 4158 
Val Ala Lys Gly Glu Phe Asn Ala Val Thr Thr Asp Ala Thr Thr Ala 
1320 1325 1330 

CAA GGC ACA AAT GCC AAT GAG CGC GGT AAA GTG GTT GTC AAG GGT TCA 4206 
Gin Gly Thr Asn Ala Asn Glu Arg Gly Lys Val Val Val Lys Gly Ser 
1335 1340 1345 

AAT GGT GCA ACT GCT ACC GAA ACT GAC AAG AAA AAA GTG GCA ACT GTT 42 54 

Asn Gly Ala Thr Ala Thr Glu Thr Asp Lys Lys Lys Val Ala Thr Val 
1350 1355 1360 

GGC GAC GTT GCT AAA GCG ATT AAC GAC GCA GCA ACT TTC GTG AAA GTG 4 302 

Gly Asp Val Ala Lys Ala He Asn Asp Ala Ala Thr Phe Val Lys Val 
1365 1370 1375 1380 

GAA AAT GAC GAC AGT GCT ACG ATT GAT GAT AGC CCA ACA GAT GAT GGC 4 3 50 

Glu Asn Asp Asp Ser Ala Thr He Asp Asp Ser Pro Thr Asp Asp Gly 
1385 1390 1395 

GCA AAT GAT GCT CTC AAA GCA GGC GAC ACC TTG ACC TTA AAA GCG GGT 4 3 98 

Ala Asn Asp Ala Leu Lys Ala Gly Asp Thr Leu Thr Leu Lys Ala Gly 
1400 1405 1410 

AAA AAC TTA AAA GTT AAA CGT GAT GGT AAA AAT ATT ACT TTT GCC CTT 4446 
Lys Asn Leu Lys Val Lys Arg Asp Gly Lys Asn He Thr Phe Ala Leu 
1415 1420 1425 

GCG AAC GAC CTT AGT GTA AAA AGC GCA ACC GTT AGC GAT AAA TTA TCG 4 4 94 

Ala Asn Asp Leu Ser Val Lys Ser Ala Thr Val Ser Asp Lys Leu Ser 
1430 1435 1440 

CTT GGT ACA AAC GGC AAT AAA GTC AAT ATC ACA AGC GAC ACC AAA GGC 4 54 2 

Leu Gly Thr Asn Gly Asn Lys Val Asn He Thr Ser Asp Thr Lys Gly 
1445 1450 1455 1460 

TTG AAC TTC GCT AAA GAT AGT AAG ACA GGC GAT GAT GCT AAT ATT CAC 4590 
Leu Asn Phe Ala Lys Asp Ser Lys Thr Gly Asp Asp Ala Asn He His 
1465 1470 1475 

TTA AAT GGC ATT GCT TCA ACT TTA ACT GAT ACA TTG TTA AAT AGT GGT 4 6 38 

Leu Asn Gly He Ala Ser Thr Leu Thr Asp Thr Leu Leu Asn Ser Gly 
1480 1485 1490 

GCG ACA ACC AAT TTA GGT GGT AAT GGT ATT ACT GAT AAC GAG AAA AAA 4 686 

Ala Thr Thr Asn Leu Gly Gly Asn Gly He Thr Asp Asn Glu Lys Lys 
1495 1500. 1505 
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r-nr Arr GTT AAA GAT GTC TTG AAT GCG GGT TGG AAT GTT CGT 
Z A^a Sa Ser vS ^ Sp Val .eu Asn Ala Gly^Trp Asn Val Arg 

1510 ^5^^ 
^ ^ ^ ceo TCT CC. »T .T. ^ «T ..C OJC - 

Gly val Lys Pro Ala Ser Ala Asn Asn ^,J.n ^^^^ 
1525 1530 "35 



„ p.p TTT GTT AGT GGA GAT AAA GAC ACC 

Si ?S ?S ^ S= SI VaX S„ C.V «P - 

1545 ^5^" 

?s £vS s^i t:: ^ Sr- ^ ^ io^ 
?r. z ?s in ?n ill z z -I ^ ™ 

1575 15^° 

TTT ACA GGC AAA GAG CTG AAG GAT GCT AAC AAT AAT GGC GTA ACT GTT 
TZ rlr Gly I.ys Glu Leu Lys Asp Ala Asn Asn Asn^Gly Val Thr Val 

1590 15^^ 
.CC GAA ACC GAC GGC AAA GAC GAG GGT AAT GGT TTA GTG ACT GCA AAA 

Thr Glu Thr ASP Gly Lys Asp Glu Gly Asn f vj^^ ^^'^ ^^20 

1605 1610 

GCT CTG ATT GAT GCC GTG AAT AAG GCT GGT TG^ AGA GTT AAA ACA ACA 
Ala val lie Asp Ala Val Asn Lys Ala ^rp Arg vai y 

1625 1^^" 

z s -I zp -I ti zi ;ii s= ^z ™^ 
-I ^z z z zz ?s ?s z zp ^ 
z z zz iv. z z z zi z z 

1670 ^^'^^ 

r^^. nzvr rcr GAT AAA ATC GTT GCA GAC ACG ACC GTA CTT 
GGC TTA AAA CTA GAC GGC GAT AAA Aiu 

Gly Leu Lys Leu Asp Gly Asp Lys He Val ^la Asp ^^^^ 

1685 '"^ 

^^n^ ^nz, rXT ACA GCT CCG AAT AAT GGC GAT GGT AAG 
ACT GTG GCA GAT GGT AAA GTT ACA GCT 

Thr val Ala Asp Gly Lys Val Tnr Aia ^^^^ 
1705 '^^^^ 

^ r^r^r. i^rT CGT TTA GCG GAT GCG TTA AAT AAA TTA AGC 
AAA TTT GTT GAT GCA AGT GGT TTA GCG 
Lys Phe val Asp Ala Ser Gly Leu Ala Asp Ala 



4734 



47B2 



4830 



4878 



4926 



4974 



5022 



5070 



5118 



5166 



5214 



5262 



5310 



5358 



1720 ^•'^S 
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TGG ACG GCA ACT GCT GGT AAA GAA GGC ACT GGT GAA GTT GAT CCT GCA 54 06 

Trp Thr Ala Thr Ala Gly Lys Glu Gly Thr Gly Glu Val Asp Pro Ala 
1735 1740 1745 

AAT TCA GCA GGG CAA GAA GTC AAA GCG GGC GAC AAA GTA ACC TTT AAA 5454 
Asn Ser Ala Gly Gin Glu Val Lys Ala Gly Asp Lys Val Thr Phe Lys 
1750 1755 1760 

GCC GGC GAC AAC CTG AAA ATC AAA CAA AGC GGC AAA GAC TTT ACC TAC 5502 
Ala Gly Asp Asn Leu Lys lie Lys Gin Ser Gly Lys Asp Phe Thr Tyr 
1765 1770 1775 1780 

TCG CTG AAA AAA GAG CTG AAA GAC CTG ACC AGC GTA GAG TTC AAA GAC 5550 
Ser Leu Lys Lys Glu Leu Lys Asp Leu Thr Ser Val Glu Phe Lys Asp 
17B5 1790 1795 

GCA AAC GGC GGT ACA GGC AGT GAA AGC ACC AAG ATT ACC AAA GAC GGC 5598 
Ala Asn Gly Gly Thr Gly Ser Glu Ser Thr Lys lie Thr Lys Asp Gly 
1800 1805 1810 

TTG ACC ATT ACG CCG GCA AAC GGT GCG GGT GCG GCA GGT GCA AAC ACT 564 6 

Leu Thr lie Thr Pro Ala Asn Gly Ala Gly Ala Ala Gly Ala Asn Thr 
1815 1820 1825 

GCA AAC ACC ATT AGC GTA ACC AAA GAT GGC ATT AGC GCG GGT AAT AAA 56 94 

Ala Asn Thr lie Ser Val Thr Lys Asp Gly lie Ser Ala Gly Asn Lys 
1830 1835 1840 

GCA GTT ACA AAC GTT GTG AGC GGA CTG AAG AAA TTT GGT GAT GGT CAT 574 2 

Ala Val Thr Asn Val Val Ser Gly Leu Lys Lys Phe Gly Asp Gly His 
1845 1850 1855 1860 

ACG TTG GCA AAT GGC ACT GTT GCT GAT TTT GAA AAG CAT TAT GAC AAT 57 90 

Thr Leu Ala Asn Gly Thr Val Ala Asp Phe Glu Lys His Tyr Asp Asn 
1865 1870 1875 

GCC TAT AAA GAC TTG ACC AAT TTG GAT GAA AAA GGC GCG GAT AAT AAT 583 8 

Ala Tyr Lys Asp Leu Thr Asn Leu Asp Glu Lys Gly Ala Asp Asn Asn 
1880 1885 1890 

CCG ACT GTT GCC GAC AAT ACC GCT GCA ACC GTG GGC GAT TTG CGC GGC 5886 
Pro Thr Val Ala Asp Asn Thr Ala Ala Thr Val Gly Asp Leu Arg Gly 
1895 1900 1905 

TTG GGC TGG GTC ATT TCT GCG GAC AAA ACC ACA GGC GAA CCC AAT CAG 5934 
Leu Gly Trp Val lie Ser Ala Asp Lys Thr Thr Gly Glu Pro Asn Gin 
1910 1915 1920 

GAA TAC AAC GCG CAA GTG CGT AAC GCC AAT GAA GTG AAA TTC AAG AGC 5 982 

Glu Tyr Asn Ala Gin Val Arg Asn' Ala Asn Glu Val Lys Phe Lys Ser 
1925 1930 1935 1940 

GGC AAC GGT ATC AAT GTT TCC GGT AAA ACA TTG AAC GGT ACG CGC GTG 6 03 0 

Gly Asn Gly lie Asn Val Ser Gly Lys Thr Leu Asn Gly Thr Arg Val 
1945 1950 1955 
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^ ^^1. CGC GAA GTG GTT AAA TCG AAT GAA TTT 
ATT ACC TTT GAA TTG GCT AAA GGC GAA G^^ 

lie Thr Phe Glu Leu Ala Lys Giy ^ ^^,0 
1960 

.T. «T CCC 0^ - ^ .CO «C C„ ^ CT. 

Thr Val Lys Asn Ala Asp Gly Ser^t.x ^^^^ 

OAT ATG TtAC AGC AAA GAG GAT ATT GAC CCG GCA ACC AGT AAA CCG 
ASP Met Tyr Tyr Ser Lys Glu Asp He Asp ^^^^ 
1990 

-.n. T.r.r rTT GAA AAC GGC AAA GTC GTT 
ATG ACA GGT AAA ACT GAA ^ TAT AAG GTT GAA 

Met Thr Gly Lys Thr Glu Lys Tyr Lys ^^^^ 2020 

2005 2010 

.CT GCT AAC GGC AGC AAG ACC GAA GTT ACC CTA ACC AAC AAA GGT TCC 
ser Ala Asn Gly Ser Lys Thr Glu Val Tn ^^^^ 
2025 

T« C« COT «C C« .TO OCT 0« OCO .TT OCO ^ T» OOC 

Gly Tyr Val Thr Gly Asn Gin Val ^la Asp ^^^^ 
2040 

^ 3.0 OOT TXO C« ==0 OC. 0« OCT 0» ^ OCC TTT 0» 

Phe Glu Leu Gly Leu Ala Asp Ala Ala ex ^^^^ 
2055 2060 

^ T^r TCT AAA GAT AAA GCG GAA ACT GTA 

GAA AGC GCA AAA GAC AAG CAA TTG TCT AAA G 
Glu ser Ala Lys Asp Lys Gin Leu Ser Lys A p ^y^^ 
2070 2075 

3CC C.C O.T «C CO. TTT OCT »T COT TT. ^T .CC ^ OTO 

Asn Ma His MP I.y» ^^^^/'^ 2095 

Toco OC. .CO OTO 0« ^= .CT 0« OC. «C OOC 0.T ^ OTO .CC 

ser Ala Ala Thr Val Glu Ser Thr Asp a ^^^^ 
2105 "^^"^^ 

«,rT,^ /-r"r TT^Zv ACG CAA ATC TAC 

?S S vl^^^t:^ tS l^J^ - 1- - 

.CC «T OOT «T »0 .TC OTT ^ ^ OCT O^.C OO. ^ 

Asn Thr Asp Ala Asn Gly Asn ^y^^ 2i4 5 

_ TcTO »T OC. O.T OCT .CO O^CO .OT ^C ^ 0„ OTO .C. 

Trp Tyr Glu Leu Asn Ala ^^P^"^ ^ 2160 

T«C OTO 0.T OC. ..C OOT «0 ^ OTT OT= ^ OT. .CC 0» 
Oly Mn v,l MP .la Mn Gly ^^^^ 218 



6078 



6126 



6174 



6222 



6270 



6318 



6366 



64.14 



6462 



6510 



6558 



6606 



6654 



6702 



2165 
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AAT GGT GCG GAT AAG TGG TAT TAC ACC AAT GCT GAC GGT GCT GCG GAT 6 750 

Asn Gly Ala Asp Lys Trp Tyr Tyr Thr Asn Ala Asp Gly Ala Ala Asp 
2185 2190 2195 

AAA ACC AAA GGC GAA GTG AGC AAT GAT AAA GTT TCT ACC GAT GAA AAA 67 98 

Lys Thr Lys Gly Glu Val Ser Asn Asp Lys Val Ser Thr Asp Glu Lys 
2200 2205 2210 

CAC GTT GTC CGC CTT GAT CCG AAC AAT CAA TCG AAC GGC AAA GGC GTG 6846 
His Val Val Arg Leu Asp Pro Asn Asn Gin Ser Asn Gly Lys Gly Val 
2215 2220 2225 

GTC ATT GAC AAT GTG GCT AAT GGC GAA ATT TCT GCC ACT TCC ACC GAT 6894 
Val lie Asp Asn Val Ala Asn Gly Glu He Ser Ala Thr Ser Thr Asp 
2230 2235 2240 

GCG ATT AAC GGA ACT CAG TTG TAT GCC GTG GCA AAA GGG GTA ACA AAC 6 942 

Ala He Asn Gly Ser Gin Leu Tyr Ala Val Ala Lys Gly Val Thr Asn 
2245 2250 2255 2260 

CTT GCT GGA CAA GTG AAT AAT CTT GAG GGC AAA GTG AAT AAA GTG GGC 6 990 

Leu Ala Gly Gin Val Asn Asn Leu Glu Gly Lys Val Asn Lys Val Gly 
2265 2270 2275 

AAA CGT GCA GAT GCA GGT ACA GCA AGT GCA TTA GCG GCT TCA CAG TTA 703 8 

Lys Arg Ala Asp Ala Gly Thr Ala Ser Ala Leu Ala Ala Ser Gin Leu 
2280 2285 2290 

CCA CAA GCC ACT ATG CCA GGT AAA TCA ATG GTT GCT ATT GCG GGA AGT 7 08 6 

Pro Gin Ala Thr Met Pro Gly Lys Ser Met Val Ala He Ala Gly Ser 
2295 2300 2305 

AGT TAT CAA GGT CAA AAT GGT TTA GCT ATC GGG GTA TCA AGA ATT TCC 7134 
Ser Tyr Gin Gly Gin Asn Gly Leu Ala He Gly Val Ser Arg He Ser 
2310 2315 2320 

GAT AAT GGC AAA GTG ATT ATT CGC TTG TCA GGC ACA ACC AAT AGT CAA 7182 
Asp Asn Gly Lys Val He He Arg Leu Ser Gly Thr Thr Asn Ser Gin 
2325 2330 2335 2340 

GGT AAA ACA GGC GTT GCA GCA GGT GTT GGT TAC CAG TGG TAAAGTTTGG 72 31 

Gly Lys Thr Gly Val Ala Ala Gly Val Gly Tyr Gin Trp 
2345 2350 

ATTATCTCTC TTAAAAAGCG GCATTTGCCG CTTTTTTTAT GGGTGGCTAT TATGTATCGT 72 91 



(2) INFORMATION FOR SEQ ID NO : 4 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 353 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO:4: 

w . « val lie Trp Asn Val Met Thr Gin Thr Trp 
Met Asn Lvs lie Phe Asn Val He Trp ^5 

1 ^ 

T Thr Arg Thr His Thr Lys Arg Ala Ser Ala 
val Val val Ser Glu Leu Thr ^ 30 

20 

Thr Leu Leu Phe Ala Thr Val Gin 
Thr val Glu Thr Ala Val Leu Ala Thr Leu L 

35 

r.-, r-i» Leu ASP Pro Val Val Arg Thr 
Ala Asn Ala Thr Asp Glu Asp Glu Glu Leu Asp 

. T , ser Phe His ser Asp Lys Glu Gly Thr Gly Glu Lys 
Ala Pro Val Leu Ser Pne 8 0 

65 "^^ 

rlv lie Tyr Phe Asp Asn Lys Gly 
Glu val Thr Glu Asn Ser Asn Trp Gly He Ty 
85 

z.i» Tie Thr Leu Lys Ala Gly Asp Asn Leu Lys 
val Leu Lys Ala Gly Ala He Thr L.e^ y 

100 

...t. Glu ser Thr Asn Ala Ser Ser Phe Thr Tyr 
lie Lys Gin Asn Thr Asp Glu Ser 
115 

ser Leu Lys Lys Asp Leu Thr Asp Leu Thr Ser Val Ala Thr Glu Lys 

.eu Z Phe Gly Ala Asn Gly Asp Lys Val Asp Xle Thr Ser Asp Ala 
145 

, T Glv Asn Gly Asn Val His Leu Asn 

Asn Gly Leu Lys Leu Ala Lys Thr Gly Asn y 
165 

s,. ... .e. «P - V,. ... «n ... OX. v.. 
se, se. s,. r. ... ... - - - Ifs 

195 

,.=r, Ma Glv Trp Asn He Lys Gly Ala Lys 
Thr val Lys Asp Val Leu Asn Ala Gly Trp 

210 

«a .XV =1. - 0.. s„ v.. .sp v.. se. Tvr 

Z va. - -V - - :u - 

... ..a .vs - O.. .V. - va. .VS P.- - - - 

260 

*=r, Glv LVS Leu Phe Thr Gly Lys Glu 
Thr ser Val He Lys Glu Lys AsP Gly Lys 

275 
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Asn Asn Asp Thr Asn Lys Val Thr Ser Asn Thr Ala Thr Asp Asn Thr 

290 295 300 

Asp Glu Gly Asn Gly Leu Val Thr Ala Lys Ala Val lie Asp Ala Val 
305 310 315 320 

Asn Lys Ala Gly Trp Arg Val Lys Thr Thr Thr Ala Asn Gly Gin Asn 
325 330 335 

Gly Asp Phe Ala Thr Val Ala Ser Gly Thr Asn Val Thr Phe Glu Ser 
340 345 350 

Gly Asp Gly Thr Thr Ala Ser Val Thr Lys Asp Thr Asn Gly Asn Gly 
355 360 365 

lie Thr Val Lys Tyr Asp Ala Lys Val Gly Asp Gly Leu Lys Phe Asp 
370 375 380 

Ser Asp Lys Lys lie Val Ala Asp Thr Thr Ala Leu Thr Val Thr Gly 
385 390 395 400 

Gly Lys Val Ala Glu lie Ala Lys Glu Asp Asp Lys Lys Lys Leu Val 
405 410 415 

Asn Ala Gly Asp Leu Val Thr Ala Leu Gly Asn Leu Ser Trp Lys Ala 
420 425 430 

Lys Ala Glu Ala Asp Thr Asp Gly Ala Leu Glu Gly lie Ser Lys Asp 
435 440 445 

Gin Glu Val Lys Ala Gly Glu Thr Val Thr Phe Lys Ala Gly Lys Asn 
450 455 460 

Leu Lys Val Lys Gin Asp Gly Ala Asn Phe Thr Tyr Ser Leu Gin Asp 
465 470 475 4B0 

Ala Leu Thr Gly Leu Thr Ser lie Thr Leu Gly Gly Thr Thr Asn Gly 
485 490 495 

Gly Asn Asp Ala Lys Thr Val He Asn Lys Asp Gly Leu Thr He Thr 
500 505 510 

Pro Ala Gly Asn Gly Gly Thr Thr Gly Thr Asn Thr He Ser Val Thr 
515 520 525 

Lys Asp Gly He Lys Ala Gly Asn Lys Ala He Thr Asn Val Ala Ser 
530 535 540 

Gly Leu Arg Ala Tyr Asp Asp Ala Asn Phe Asp Val Leu Asn Asn Ser 
545 550 555 560 

Ala Thr Asp Leu Asn Arg His Val Glu Asp Ala Tyr Lys Gly Leu Leu 
565 570 575 

Asn Leu Asn Glu Lys Asn Ala Asn Lys Gin Pro Leu Val Thr Asp Ser 
580 585 590 
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Thr Ala Ala Thr Val Oly Asp Leu Arg l.ys Leu Gly Trp Val Val Ser 

595 

Thr Lys Asn Gly Thr Lys Glu Glu Ser Asn Gin Val Lys Gin Ala Asp 

610 

alu val Leu Phe Thr Gly Ala Gly Ala Ala Thr Val Thr Ser Lys Ser 
625 

Glu Asn Gly Lys His Thr He Thr Val Ser Val Ala Glu Thr Lys Ala 

645 

«p cys Gly ..u Olu Lys Gly «P Thr 11= Lys I-» Lys val «p 

660 



;,sn Gin Asn Thr Asp Asn Val Leu Thr Val Gly Asn Asn Gly Thr Ala 

675 

val Thr Lys Gly Gly Phe Glu Thr Val Lys Thr Gly Ala Thr Asp Ala 

690 

ASP Arg Gly Lys Val Thr Val Lys Asp Ala Thr Ala Asn Asp Ala Asp 
705 ■'lO ^"-^ 

Lys Lys val Ala Thr Val Lys Asp Val Ala Thr Ala He Asn Ser Ala 

725 '^^^ 

Ala Thr Phe Val Lys Thr Glu Asn Leu Thr Thr Ser He Asp Glu Asp 
740 ■^'IS 

Asn pro Thr Asp Asn Gly Lys Asp Asp Ala Leu Lys Ala Gly Asp Thr 

755 "^^^ 

X.eu Thr Phe Lys Ala Gly Lys Asn Leu Lys Val Lys Arg Asp Gly Lys 

770 

Asn He Thr Phe Asp Leu Ala Lys Asn Leu Glu Val Lys Thr Ala Lys 
785 790 755 

val ser Asp Thr Leu Thr He Gly Gly Asn Thr Pro Thr Gly Gly Thr 
805 ^-^^ 



Thr Ala Thr Pro Lys 
820 



val Asn lie Thr Ser Thr Ala Asp Gly Leu Asn 



825 



830 



Gly ser Lys Asn Val Tyr Leu 



Phe Ala Lys Glu Thr Ala Asp Ala Ser - 
835 

Lys =ly lie Al. Thr Thr Leu Thr Glu Pro .er Al. Gly Ala Lys ser 
B50 

Ser Asn Ala Ala 

__ ^ * T All iven v/a >«^L^ j-kj.«s». — J — * 

Ser 

865 870 



His val ASP Leu Asn Val Asp Ala Thr Lys Lys Ser Asn 
870 

r-i^ tto Asn He Gin Gly Asn Gly 
ser He Glu Asp Val Leu Arg Ala Gly Trp Asn 



885 
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Asn Asn Val Asp Tyr Val Ala Thr Tyr Asp Thr Val Asn Phe Thr Asp 
900 905 910 

Asp Ser Thr Gly Thr Thr Thr Val Thr Val Thr Gin Lys Ala Asp Gly 
915 920 925 

Lys Gly Ala Asp Val Lys lie Gly Ala Lys Thr Ser Val lie Lys Asp 
930 935 940 

His Asn Gly Lys Leu Phe Thr Gly Lys Asp Leu Lys Asp Ala Asn Asn 
945 950 955 960 

Gly Ala Thr Val Ser Glu Asp Asp Gly Lys Asp Thr Gly Thr Gly Leu 
965 970 975 

Val Thr Ala Lys Thr Val lie Asp Ala Val Asn Lys Ser Gly Trp Arg 
980 985 990 

Val Thr Gly Glu Gly Ala Thr Ala Glu Thr Gly Ala Thr Ala Val Asn 
995 1000 1005 

Ala Gly Asn Ala Glu Thr Val Thr Ser Gly Thr Ser Val Asn Phe Lys 
1010 1015 1020 

Asn Gly Asn Ala Thr Thr Ala Thr Val Ser Lys Asp Asn Gly Asn He 
3-025 1030 1035 1040 

Asn Val Lys Tyr Asp Val Asn Val Gly Asp Gly Leu Lys He Gly Asp 
1045 1050 1055 

Asp Lys Lys He Val Ala Asp Thr Thr Thr Leu Thr Val Thr Gly Gly 
1060 1065 1070 

Lys Val Ser Val Pro Ala Gly Ala Asn Ser Val Asn Asn Asn Lys Lys 
1075 1080 1085 

Leu Val Asn Ala Glu Gly Leu Ala Thr Ala Leu Asn Asn Leu Ser Trp 
1090 1095 1100 

Thr Ala Lys Ala Asp Lys Tyr Ala Asp Gly Glu Ser Glu Gly Glu Thr 
1105 1110 1115 1120 

Asp Gin Glu Val Lys Ala Gly Asp Lys Val Thr Phe Lys Ala Gly Lys 
1125 1130 1135 

Asn Leu Lys Val Lys Gin Ser Glu Lys Asp Phe Thr Tyr Ser Leu Gin 
1140 1145 1150 

Asp Thr Leu Thr Gly Leu Thr Ser He Thr Leu Gly Gly Thr Ala Asn 
1155 1160 1165 

Gly Arg Asn Asp Thr Gly Thr Val He Asn Lys Asp Gly Leu Thr He 
1170 1175 1180 

Thr Leu Ala Asn Gly Ala Ala Ala Gly Thr Asp Ala Ser Asn Gly Asn 
1185 1190 1195 1200 
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... n. s„ V,. .«^.vs o.. x.e se.^..a ..V - 

V. r.. ... - - - - ?^3„"" 

^ .V. - - if^r - 

1235 

TvTa THr Val Ser Ala Lys 

Olu val Olu Phe va. O.y^.ys - oxy 

r;.n - ^' 
.y «y - - - ""'^^ 

^ 1285 

T i.eu Thr val Asp Ala Thr 

:.vs val «P " 

1300 

... .y .a s„ V.1 .a - - ^ 

r«a =1. 31y -o"^ 
..1 ir^ly s,. «v^"» - 

:r.a val Cly val .1. .y^ ^ ^ ^3'^ 

1365 

..a val .y= val Ol. «n «P S«^,"» ^ f^lo"' 

r.i. - ^" 

t.eu LVS val Lys Arg Asp Gly Lys Asn He 
l.eu Lys Ala Gly Lys Asn Leu^l^ys 

. .Xa Asn ASP .eu Se. Val .ys Ser Ma T.r Val Ser^ 
Thr Phe Ala Leu Ala Asn^A p ^^^^ 

s« «u Oly T« .an Oly .y= val .a. 

ASP Lys Leu Ser Leu ^xy ^^^^ 
^ 1445 

T ^,o Thr Gly Asp Asp 

.ys .ly - " 

. na «n 31y XI. "a Sa. -^"^ ™' 

Ala Asn iJ-e "-^=' 14B0 

. Thr Thr Asn Leu Gly Gly Asn Gly He Thr Asp 
I,eu Asn Ser Gly Ala Thr Thr^As ^^^^ 
1490 
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Asn Glu Lys Lys Arg Ala Ala Ser Val Lys Asp Val Leu Asn Ala Gly 

Tcin 1515 1520 

1505 ISIO J.3J.3 

Trp Asn Val Arg Gly Val Lys Pro Ala Ser Ala Asn Asn Gin Val Glu 

1525 1530 1535 

Asn He Asp Phe Val Ala Thr Tyr Asp Thr Val Asp Phe Val Ser Gly 
1540 1545 1550 

Asp Lys Asp Thr Thr Ser Val Thr Val Glu Ser Lys Asp Asn Gly Lys 
1555 1560 1565 

Arg Thr Glu Val Lys He Gly Ala Lys Thr Ser Val He Lys Asp His 
1570 1575 1580 

Asn Gly Lys Leu Phe Thr Gly Lys Glu Leu Lys Asp Ala Asn Asn Asn 
1585 1590 1595 1600 

Gly Val Thr Val Thr Glu Thr Asp Gly Lys Asp Glu Gly Asn Gly Leu 
1605 1610 1615 

val Thr Ala Lys Ala Val He Asp Ala Val Asn Lys Ala Gly Trp Arg 
1620 1625 1S30 

Val Lys Thr Thr Gly Ala Asn Gly Gin Asn Asp Asp Phe Ala Thr Val 
1635 1640 1645 

Ala Ser Gly Thr Asn Val Thr Phe Ala Asp Gly Asn Gly Thr Thr Ala 
1650 1655 1660 

Glu Val Thr Lys Ala Asn Asp Gly Ser He Thr Val Lys Tyr Asn Val 
1665 1670 1675 1680 

Lys Val Ala Asp Gly Leu Lys Leu Asp Gly Asp Lys He Val Ala Asp 
1685 1690 1695 

Thr Thr Val Leu Thr Val Ala Asp Gly Lys Val Thr Ala Pro Asn Asn 
1700 1705 1710 

Gly Asp Gly Lys Lys Phe Val Asp Ala Ser Gly Leu Ala Asp Ala Leu 
1715 1720 1725 

Asn Lys Leu Ser Trp Thr Ala Thr Ala Gly Lys Glu Gly Thr Gly Glu 
1730 1735 1740 

Val Asp Pro Ala Asn Ser Ala Gly Gin Glu Val Lys Ala Gly Asp Lys 
1745 1750 1755 1760 

Val Thr Phe Lys Ala Gly Asp Asn Leu Lys He Lys Gin Ser Gly Lys 
1765 1770 1775 

Asp Phe Thr Tyr Ser Leu Lys Lys Glu Leu Lys Asp Leu Thr Ser Val 
1780 1785 1790 

Glu Phe Lys ASP Ala Asn Gly Gly Thr Gly Ser Glu Ser Thr Lys He 
1795 1800 1805 
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Thr Pro Ala Asn Gly Ala Gly Ala Ala 
Thr Lys ASP Gly Leu Thr ^hr Pro ax ^^^^ 

IBIO 

^v,^ Tie ser Val Thr Lys Asp Gly He Ser 
Gly Ala Asn Thr Ala Asn Thr He Ser ^^^^ ^3^0 

ox. .s„ ... v.. «n v.. v.. - -vs - 

1845 

^sn Glv Thr val Ala Asp Phe Glu Lys 
Gly ASP Gly His Thr Leu Ala Asn Gly^Th ^^^^ 
I860 

. T.VS ASP Leu Thr Asn Leu Asp Glu Lys Gly 

His Tyr ASP Asn Ala Tyr Lys Asp^L ^^^^ 

a^v,. val Ala Asp Asn Thr Ala Ala Thr Val Gly 
Ala ASP Asn Asn Pro Thr ^la A p ^^^^ 

1890 ^^^^ 



... «y v.. .1. s« ..P ^rs TH. ™. o.v^ 

1910 

. ,.1=, Gin val Arg Asn Ala Asn Glu Val 

Gin Glu Tyr Asn Ala Gin '^'^9 3.935 



Glu pro Asn GXn ^xu xyx — 
1925 



r-T^ Tie Asn Val Ser Gly i-ys j^"* 

Lys Phe I.ys Ser Gly Asn Gly He Asn ^^^^ 

1940 

TVS Glv Glu Val Val Lys 
T XI « THr Phe Glu Leu Ala Lys trxy 
Gly Thr Arg val He Thr Pne ^^^^ ^^^^ 

... l'r..e X« vaX .V. n»0=" 

1970 ^ 



Met Tyr Tyr Ser Lys Glu Asp He Asp Pro Ala^ 



val Lys val Gly Asp Mer^iT- - ^^^^ 



s.. ... «o ... cav - - - - r 

2005 



Ala Asn Gly Ser Lys Thr Glu Val Thr^Leu Thr 



Gly Lys val Val Ser „ - 

2020 

... "V ^ v.. v.. «.^^P 

... r... ... ... -V 

... r..e «. =XU se.^... .V= - - ^r,0 

r=.u V,. 1 - - -0"^ 

2085 

z.ia Thr val Glu ser Thr Asp Ala Asn Gly 
Asn Thr Lys val Ser Ala Ala Thr VaX^ ^^^^ 
2100 
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Asp Lys Val Thr Thr Thr Phe Val Lys Thr Asp Val Glu Leu Pro Leu 
2115 2120 2125 

Thr Gin lie Tyr Asn Thr Asp Ala Asn Gly Asn Lys lie Val Lys Lys 
2130 2135 5140 

Ala Asp Gly Lys Trp Tyr Glu Leu Asn Ala Asp Gly Thr Ala Ser Asn 
2145 2150 2155 2160 

Lys Glu Val Thr Leu Gly Asn Val Asp Ala Asn Gly Lys Lys Val Val 
2165 2170 2175 

Lys Val Thr Glu Asn Gly Ala Asp Lys Trp Tyr Tyr Thr Asn Ala Asp 
2180 2185 2190 

Gly Ala Ala Asp Lys Thr Lys Gly Glu Val Ser Asn Asp Lys Val Ser 
2195 2200 2205 

Thr Asp Glu Lys His Val Val Arg Leu Asp Pro Asn Asn Gin Ser Asn 
2210 2215 2220 

Gly Lys Gly Val Val lie Asp Asn Val Ala Asn Gly Glu lie Ser Ala 
2225 2230 2235 2240 

Thr Ser Thr Asp Ala lie Asn Gly Ser Gin Leu Tyr Ala Val Ala Lys 
2245 2250 2255 

Gly Val Thr Asn Leu Ala Gly Gin Val Asn Asn Leu Glu Gly Lys Val 
2260 2265 2270 

Asn Lys Val Gly Lys Arg Ala Asp Ala Gly Thr Ala Ser Ala Leu Ala 
2275 2280 2285 

Ala Ser Gin Leu Pro Gin Ala Thr Met Pro Gly Lys Ser Met Val Ala 
2290 2295 2300 

lie Ala Gly Ser Ser Tyr Gin Gly Gin Asn Gly Leu Ala lie Gly Val 
2305 2310 2315 2320 

Ser Arg lie Ser Asp Asn Gly Lys Val lie lie Arg Leu Ser Gly Thr 
2325 2330 2335 

Thr Asn Ser Gin Gly Lys Thr Gly Val Ala Ala Gly Val Gly Tyr Gin 
2340 2345 2350 

Trp 



(2) INFORMATION FOR SEQ ID NO: 5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 658 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : xmknown 

(D) TOPOLOGY: unknown 
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(ii) MOLECULE TYPE: protein 

SEQUENCE DESCRIPTION: SEQ ID NO: 5: 



..e V. V. V. ... - - 

L V.X V,. se. L - - - - f: 

20 

Thr Leu Leu Ser Ala Thr Val Glu 
Thr val Ala Val Ala Val Leu Ala Thr 

v.l Thr Asn Lys Leu Lys Ala Tyr Gly Asp 
Ala Asn Asn Asn Thr Pro Val Thr Asn y 

1 P., ^» - - - - f.' - 

1 ... ^ ^v. - - « 

85 

TV ^ Thr Ala Ala Thr Val Gly Asn 
ser ASP Lys Leu Leu Val Glu Asp Asn Thr 
100 

n T»„ ser ser Lys Asn Gly Thr Arg Asn 
Leu Arg Lys Leu Gly Trp Val Leu Ser 

u Rla ASP Glu val Leu Phe Glu Gly 
1-1 r. Val Lvs His Aia Asp "-^ 
Glu Lys ser Gin Gin vai uy^ 

Z v.. v.. se. S« .XU «n O.V X.. 

... 1 - - V. ... - - V. 

... r. =.v - - -V - - 

... ..o .vs r. v.. ... |« - - - 

195 

-nw^ tht- Val His Leu Asn Gly 
^^P «a «^ ™' Ifo 

T7=.i riv ser Pro Ala Thr His 

Xle Gly se. ... t\l 

230 

c Thr His Tyr Thr Arg Ala Ala Ser He 
lie ASP Gly Gly Asp Gin Ser Thr His Ty^ 
245 

Ti« TVS Glv Val Lys Ala Gly 

.sp V.1 .=u «n ..P 

260 
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Ser Thr Thr Gly Gin Ser Glu Asn Val Asp Phe Val His Thr Tyr Asp 
275 280 285 

Thr Val Glu Phe Leu Ser Ala Asp Thr Glu Thr Thr Thr Val Thr Val 
290 295 300 

Asp Ser Lys Glu Asn Gly Lys Arg Thr Glu Val Lys lie Gly Ala Lys 
305 310 315 320 

Thr Ser Val lie Lys Glu Lys Asp Gly Lys Leu Phe Thr Gly Lys Ala 
325 330 335 

Asn Lys Glu Thr Asn Lys Val Asp Gly Ala Asn Ala Thr Glu Asp Ala 
340 345 350 

Asp Glu Gly Lys Gly Leu Val Thr Ala Lys Asp Val He Asp Ala Val 
355 360 365 

Asn Lys Thr Gly Trp Arg He Lys Thr Thr Asp Ala Asn Gly Gin Asn 
370 375 380 

Gly Asp Phe Ala Thr Val Ala Ser Gly Thr Asn Val Thr Phe Ala Ser 
385 390 395 400 

Gly Asn Gly Thr Thr Ala Thr Val Thr Asn Gly Thr Asp Gly He Thr 
405 410 415 

Val Lys Tyr Asp Ala Lys Val Gly Asp Gly Leu Lys Leu Asp Gly Asp 
420 425 430 

Lys He Ala Ala Asp Thr Thr Ala Leu Thr Val Asn Asp Gly Lys Asn 
435 440 445 

Ala Asn Asn Pro Lys Gly Lys Val Ala Asp Val Ala Ser Thr Asp Glu 
450 455 460 

Lys Lys Leu Val Thr Ala Lys Gly Leu Val Thr Ala Leu Asn Ser Leu 
465 470 475 480 

Ser Trp Thr Thr Thr Ala Ala Glu Ala Asp Gly Gly Thr Leu Asp Gly 
485 490 495 

Asn Ala Ser Glu Gin Glu Val Lys Ala Gly Asp Lys Val Thr Phe Lys 
500 505 510 

Ala Gly Lys Asn Leu Lys Val Lys Gin Glu Gly Ala Asn Phe Thr Tyr 
515 520 525 

Ser Leu Gin Asp Ala Leu Thr Gly Leu Thr Ser He Thr Leu Gly Thr 
530 535 540 

Gly Asn Asn Gly Ala Lys Thr Glu He Asn Lys Asp Gly Leu Thr He 
545 550 555 560 

Thr Pro Ala Asn Gly Ala Gly Ala Asn Asn Ala Asn Thr He Ser Val 
565 570 575 
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„ Glv Gly Gin ser val Ws As" v.l val 

Thr Lys ASP Gly He ser Ala Gly Gly 

Gly .e. r= Gly Ala Aa„ Aa. 1.U S„ 

3„ Al. .a. ... ..J - - - - - - 

T c= rTn Thr pro Val Val Ala 
^<=r. Glu Lys Gly Thr Asp Lys Gin Thr 
Thr Asn Leu Asp Glu uy^ j ^3 5 

Z A." - - - - - - "O - 

645 

lie Ser 

(2) INFORMATION FOR SEQ ID NO-.G: 

(i> SEQUENCE CHARACTERISTICS: 

^ ^ (A) LENGTH: 6 07 amino acxds 

(B) TYPE: amino acid 

(C) STRANDEDNESS : unknown 

(D) TOPOLOGY: unknovm 

(ii) MOLECULE TYPE: protein 

SEQUENCE description: SEQ ID NO: 6: 

. Ti. Trp Asn val Met Thr Gin Thr Trp 
. ^ ws lie Phe Asn Val He Trp Asn 
Met Asn Lys XJ-e 



^ THr- His Thr Lys Arg Leu Arg Asn 

val val val Ser Glu Leu Thr Arg Thr H.s 
20 

^la Thr Leu Leu Phe Ala Thr Val Gin Ala 
Arg Gly Asp Pro Val Leu Ala Thr 

A. .la 1 ASP Glu ASP Glu Glu Leu Asp P.o Val Val Arg Thr Ala 
1 Leu ser Phe H.s Ser Asp Lys Glu Gly Thr Gly Glu Lys Glu 

" . alv lie Tyr Phe Asp Asn Lys Gly Val 

val Thr Glu Asn Ser Asn Trp Gly TV 

Ala Gly 1 I- Thr Leu Lys Ala Gly Asp Asn Leu Lys Xaa 

Leu Lys Ala Gly a-lu 
100 

Tvio ^f>T ser Phe Thr Tyr Ser 
Gin xa. Tnr Asp Gin Xaa T« Asn Ala S.r 
lis 

, Thr ser val Ala Thr Glu Lys Leu 
.eu Lys Lys Asp Leu Thr Asp .eu Thr 



135 
130 
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ser Phe Gly Ala Asn Gly Asp Lys Val Asp He Thr Ser Asp Ala Asn 
145 150 155 IGO 

Gly Leu Lys Leu Ala Lys Thr Gly Asn Gly Asn Val His Leu Asn Gly 
165 l^O 175 

Leu Asp ser Thr Leu Pro Asp Ala Val Thr Asn Thr Gly Val Leu Ser 
180 185 190 

Ser ser Ser Phe Thr Pro Asn Asp Val Glu Lys Thr Arg Ala Ala Thr 
195 200 205 

Val Lys Asp Val Leu Asn Ala Gly Trp Asn He Lys Gly Ala Lys Thr 
210 215 220 

Ala Gly Gly Asn Val Glu Ser Val Asp Leu Val Ser Ala Tyr Asn Asn 



225 



230 235 240 



Val Glu Phe He Thr Gly Asp Lys Asn Thr Leu Asp Val Val Leu Thr 
245 250 255 

Ala Lys Glu Asn Xaa Lys Thr Thr Glu Val Lys Phe Thr Pro Lys Thr 
260 265 270 

Ser Val He Lys Glu Lys Asp Gly Lys Leu Phe Thr Gly Lys Glu Asn 
275 280 285 

Asn Asp Thr Asn Lys Val Thr Ser Asn Thr Ala Thr Asp Asn Thr Asp 
290 295 300 

Glu Gly Asn Gly Leu Val Thr Ala Lys Ala Val He Asp Ala Val Asn 
305 310 315 320 

Lys Ala Gly Trp Arg Val Lys Thr Thr Thr Ala Asn Gly Gin Asn Gly 
325 330 335 

ASP Phe Ala Thr Val Ala Ser Gly Thr Asn Val Thr Phe Glu Ser Gly 
340 345 350 

Asp Gly Thr Thr Ala Ser Val Thr Lys Asp Thr Asn Gly Asn Gly He 
355 360 365 

Thr val Lys Tyr Asp Ala Lys Val Gly Asp Gly Leu Lys Phe Asp Ser 
370 375 380 



ASP Lys Lys He Val Ala Asp Thr Thr Ala Leu Thr Val. Thr Gly Gly 
385 390 395 400 

Lvs val Ala Glu He Ala Lys Glu Asp Asp Lys Lys Lys Leu Val Asn 
405 - 410 415 

Ala Gly Asp Leu Val Thr 



405 

Ala Leu Gly Asn Leu Ser Trp Lys Ala Lys 



420 



425 430 



Ala Glu Ala Asp Thr Asp Gly Ala Leu Glu Gly He Ser Lys Asp Gin 
435 440 445 
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01„ V.1 .ys «. Oly 01„ T« val Th, Phe .y= Ala Gly .y. «„ 

450 

.1 t)h«» Thr Tvr Ser Leu Gin Asp Ala 

Lvs val Lys Gin Asp Gly Ala Asn Phe Thr ^yr :»e 

465 

^ TV,,- Leu Glv Gly Thr Thr Asn Gly Gly 

Leu Thr Gly Leu Thr Ser He Thr Leu ^ ^ 

485 

.sn «p .la .ys val 11a «n .ya ^.P =ly T« lie Th, Pro 

„a =ly «n liy Oly Oly T.. «n Thr 11. Se. val Th. .ys 

515 

TvT^ Ti*k ThT- Asn Val Ala Ser Gly 
ASP Gly He Lys Ala Gly Asn Lys Ala He Thr Asn 

530 

.eu Ars Ala Tyr Asp Asp Ala Asn Phe Asp Val Leu Asn Asn Ser Ala 
545 550 

Xhr ASP Leu Asn Arg His Val Glu Asp Ala Tyr Lys Gly Leu Leu Asn 

565 

Leu Asn Glu Lys Asn Ala Asn Lys Gin Pro Leu Val Thr Asp ser Thr 

580 

AX. Al. Thr val Oly AsP .ya -au Gly Trp val val Sa. 



(2) INFORMATION FOR SEQ ID NO : 7 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : unknown 

(D) TOPOLOGY: unknown 

(ii) MOLECXJLE TYPE: protein 

(XX) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 

„a. Asn .y= na val lie Trp «n val Thr cm Thr Trp 

1 ^ 

Val val val Ser Glu Leu Thr Arg 
20 

(2) INFORMATION FOR SEQ ID NO : 8 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 amino acids 

(B) TYPE : amino acid 

(C) STRANDEDNESS: unknown 

(D) TOPOLOGY: unknown 



BNSDOCID: <WO 963051 9A1 I > 



wo 96/30519 



PCT/US96/04031 



79 

(ii) MOLECtn^E TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : B : 

Met Asn Lys lie Phe Asn Val lie Trp Ash Val Val Thr Gin Thr Trp 
15 10 15 

Val Val Val Ser Glu Leu Thr Arg 

20 

(2) INFORMATION FOR SEQ ID NO : 9 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : unknown 

( D ) TOPOLOGY : unknown 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 9 : 

Met Asn Lys lie Tyr Arg Leu Lys Phe Ser Lys Arg Leu Asn Ala Leu 
15 10 15 

Val Ala Val Ser Glu Leu Ala Arg 
20 

(2) INFORMATION FOR SEQ ID NO: 10: 

( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 24 amino acids 

(B) TYPE: amino acid 

( C ) STRANDEDNES S : unknown 

( D ) TOPOLOGY : unknown 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 

Met Asn Lys He Tyr Arg Leu Lys Phe Ser Lys Arg Leu Asn Ala Leu 
15 10 15 

Val Ala Val Ser Glu Leu Ala Arg 
20 

(2) INFORMATION FOR SEQ ID NO: 11: 

(i) SEQUENCE CHARACTERISTICS: 

<A) LENGTH: 24 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: unknown 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: protein 
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(Xi) SEQUENCE DESCRIPTION: SEQ ID N0:11: 

c lie Trp Ser His Ser Arg Gin Ala Trp 

Met Asn Lys Ala Tyr Ser He He Trp 



1 5 



lie val Ala Ser Glu Leu Ala Arg 



20 



(2) INFORMATION FOR SEQ ID NO: 12: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : unknovm 

(D) TOPOLOGY: unknovm 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:12: 

T.^rr TVT- Ser Ala Val Ala Arg Gly Phe 
Met Asn Arg He Tyr Ser Leu Arg Tyr Ser A 



1 5 



He Ala val Ser Glu Phe Ala Arg 



20 



(2) INFORMATION FOR SEQ ID NO: 13: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: unknown 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: protein 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO : 13 : 

. Tie Tvr Tyr Leu Lys Tyr Cys His He Thr Lys Ser Leu 
Met Asn Lys He Tyr i-yr i^c j 15 



1 5 



He Ala val Ser Glu Leu Ala Arg 



20 



(2) INFORMATION FOR SEQ ID NO: 14: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2037 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: unknown 

(D) TOPOI/DGY: unknown 

(ii) MOLECULE TYPE: DNA (genomic) 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 14 : 
ATGAACAAAA TTTTTAACGT TATTTGGAAT GTTGTGACTC AAACTTGGGT TGTCGTATCT 
GAACTCACTC GCACCCACAC CAAATGCGCC TCCGCCACCG TGGCAGTTGC CGTATTGGCA 
ACCCTGTTGT CCGCAACGGT TCAGGCGAAT GCTACCGATG AAAACGAAGA TGATGAAGAA 
GAGTTAGAAC CCGTACAACG CTCTGTTTTA AGGTGGAGCT TCAAATCCGC TAAGGAAGGC 
ACTGGAGAAC AAGAGGGAAC AACAGAGGTA ATAAATTTGA ACACAGATTC ATCAGGAAAT 
GCAGTAGGAA GCAGCACAAT CACCTTCAAA GCCGGCGACA ACCTGAAAAT CAAACAAAGC 
GGCAATGACT TCACCTACTC GCTGAAAAAA GAGCTGAAAA ACCTGACCAG TGTTGAAACT 
GAAAAATTAT CGTTTGGCGC AAACGGCAAT AAAGTTGATA TTACCAGTGA TGCAAATGGC 
TTGAAATTGG CGAAAACAGG TAACGGAAAT GGTCAAAACA GTAATGTTCA CTTAAACGGT 
ATTGCTTCGA CTTTGACCGA TACGCTTGCC GGTGGCACAA CAGGACACGT TGACACCAAC 
ATTGATGCGG TTAATTATCA TCGCGCTGCA AGCGTACAAG ATGTGTTAAA CAGCGGTTGG 
AATATCCAAG GCAATGGAAA CAATGTCGAT TTTGTCCGTA CTTACGACAC CGTGGACTTT 
GTCAATGGCG CGAATGCCAA TGTGAGCGTT ACGGCTGATA CGGCTCACAA AAAGACAACT 
GTCCGTGTGG ATGTAACAGG CTTGCCGGTT CAATATGTTA CGGAAGACGG CAAAACCGTT 
GTGAAAGTGG GCAATGAGTA TTACAAAGCC AAAGATGACG GTTCGGCGGA TATGAATCAA 
AAAGTCGAAA ACGGCGAGCT GGCGAAAACC AAAGTGAAAT TGGTATCGGC AAGCGGTACA 
AATCCGGTGA AAATTAGCAA TGTTGCAGAC GGCACGGAAG ACACCGATGC GGTCAGCTTT 
AAGCAATTAA AAGCCTTGCA AGACAAACAG GTTACGTTGA GCACGAGCAA TGCTTATGCC 
AATGGCGGTA CAGATAACGA CGGCGGCAAG GCAACTCAAA CTTTAAGCAA TGGTTTGAAT 
TTTAAATTTA AATCTAGCGA TGGCGAGTTG TTGAAAATTA GCGCGACCGG CGATACGGTT 
ACTTTTACGC CGAAAAAAGG TTCGGTACAG GTTGGCGATG ATGGCAAGGC TTCAATTTCA 
AAAGGTGCAA ATACAACTGA AGGTTTGGTT GAGGCTTCTG AATTGGTTGA AAGCCTGAAC 
AAACTGGGTT GGAAAGTAGG GGTTGAGAAA GTCGGCAGCG GCGAGCTTGA TGGTACATCC 
AAGGAAACTT TAGTGAAGTC GGGCGATAAA GTAACTTTGA AAGCCGGCGA CAATCTGAAG 
GTCAAACAAG AGGGCACAAA CTTCACTTAC GCGCTCAAAG ATGAATTGAC GGGCGTGAAG 
AGCGTGGAGT TTAAAGACAC GGCGAATGGT GCAAACGGTG CAAGCACGAA GATTACCAAA 



60 
120 
180 
240 
300 
360 
420 
480 
54 0 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
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««=3C.XO. C«™C=C ^C^C^ -3«T=0T. C^COOTC.C T»TOCC»C 

^..rr^a tt3cttc=» c=oca™=c o=™. ^..c«tt» — tcoc= 

.0.™ TT«T.CC« T^CCC^T CC«TT«CO 0«.C„CT. ^OT. 

CC««03C= T«C«*CCT «CT=OA«. a.=«T«TC TTO.CC.C» ...C«T^ 
..O™ CT3C.CXT3C .C=X.CTOC. .OTOC«T«= .=0™ 
.™0C C^^^C «.T<==TTTCT =T.™ ^-TC^^T 

^^r^ATTTCC GATAATGGCA AAGTGATTAT TCGCTTGTCT 
GGTTTAGCTA TCGGGGTATC AAGAATTTCC GATAA 

TAAAACAGGC GTTGCAGCAG GTGTTGGTTA CCAGTGG 
GGCACAACCA ATAGTCAAGG TAAAAt.A^v^ 

(2) INFORMATION FOR SEQ ID NO -.15: 

( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 67 9 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : unknown 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:15: 

.e. Asn Lvs lie Phe Asn VaX He T.p Asn Val Val TH. G.n T.. Trp 



162 0. 
1680 
1740 
1800 
1860 
1920 
1980 
2037 



1 ^ 



V.1 V.1 V.1 s« Clu T.. «3 Th. His T.. .ys Oy. s« 
™. v.. vl v.. .eu Ma T« ..u ..u s„ T.. v.1 

35 *° 

, . .la Thr ASP Glu Asn Glu Asp Asp Glu Glu Glu Leu Glu Pro 
Ala Asn Ala Tnr Asp go 
50 

„ phe Lvs Ser Ala Lys Glu Gly 

val Gin Arg Ser Val Leu Arg Trp Ser Phe Lys 

65 ''° 

^1 ^J^^ Ti*» Asn Leu Asn Thr Asp 
Thr Gly Glu Gin Glu Gly Thr Thr Glu val He Asn 
85 

, r.-, c«-r ser Thr He Thr Phe Lys Ala Gly 
ser ser Gly Asn Ala val Gly Ser Ser Thr 

100 

Gin ser Gly Asn Asp Phe Thr Tyr Ser Leu 
ASP Asn Leu Lys He Lys Gin Ser y 
115 

. T Thr ser Val Glu Thr Glu Lys Leu Ser 
Lys Lys Glu Leu Lys Asn Leu Thr Ser 



BNSDOClD-<WO 9630519A1 I > 



wo 96/30519 ^ ~ PCT/US96/04031 

83 

Phe Gly Ala Asn Gly Asn Lys Val Asp lie Thr Ser Asp Ala Asn Gly 
145 150 155 160 

Leu Lys Leu Ala Lys Thr Gly Asn Gly Asn Gly Gin Asn Ser Asn Val 
165 170 175 

His Leu Asn Gly lie Ala Ser Thr Leu Thr Asp Thr Leu Ala Gly Gly 
180 185 190 

Thr Thr Gly His Val Asp Thr Asn lie Asp Ala Val Asn Tyr His Arg 
195 200 205 

Ala Ala Ser Val Gin Asp Val Leu Asn Ser Gly Trp Asn lie Gin Gly 
210 215 220 

Asn Gly Asn Asn Val Asp Phe Val Arg Thr Tyr Asp Thr Val Asp Phe 
225 230 235 240 

Val Asn Gly Ala Asn Ala Asn Val Ser Val Thr Ala Asp Thr Ala His 
245 250 255 

Lys Lys Thr Thr Val Arg Val Asp Val Thr Gly Leu Pro Val Gin Tyr 
260 265 270 

Val Thr Glu Asp Gly Lys Thr Val Val Lys Val Gly Asn Glu Tyr Tyr 
275 280 285 

Lys Ala Lys Asp Asp Gly Ser Ala Asp Met Asn Gin Lys Val Glu Asn 
290 295 300 

Gly Glu Leu Ala Lys Thr Lys Val Lys Leu Val Ser Ala Ser Gly Thr 
305 310 315 320 

Asn Pro Val Lys lie Ser Asn Val Ala Asp Gly Thr Glu Asp Thr Asp 
325 330 335 

Ala Val Ser Phe Lys Gin Leu Lys Ala Leu Gin Asp Lys Gin Val Thr 
340 345 350 

Leu Ser Thr Ser Asn Ala Tyr Ala Asn Gly Gly Thr Asp Asn Asp Gly 
355 360 365 

Gly Lys Ala Thr Gin Thr Leu Ser Asn Gly Leu Asn Phe Lys Phe Lys 
370 375 380 

Ser Ser Asp Gly Glu Leu Leu Lys lie Ser Ala Thr Gly Asp Thr Val 
385 390 395 400 

Thr Phe Thr Pro Lys Lys Gly Ser Val Gin Val Gly Asp Asp Gly Lys 
405 410 415 

Ala Ser lie Ser Lys Gly Ala Asn Thr Thr Glu Gly Leu Val Glu Ala 
420 425 430 

Ser Glu Leu Val Glu Ser Leu Asn Lys Leu Gly Trp Lys Val Gly Val 
435 440 445 
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r V.1 Glv ser Gly Glu Leu Asp Gly Thr Ser Lys Glu Thr Leu 

Glu Lys Val Gly ser oxy 

4 D ^ 

val Zs ser oly Asp .y. V.. Th, .eu .ys .la =ly «u .ys 

4 "7 0 

"al X.VS ^"^ ^"-^ "* °« 

.H, Oly val .ys val Clu ..a .y. "a «n Oly - 

500 

T^ Thr Lvs ASP Gly Leu Thr He Thr Leu Ala 
Gly Ala ser Thr Lys He Thr Lys Asp g y 

515 

^„ .ly .la .s„ Cly .1. val T.. V- 

«a .sp Cly ne ser «a =ly .ys Al. »! .ys «n V.l .la 

545 

THr ser Thr Asp Ala He Asn Gly Ser Gin 
Ala Gly Glu He Ser Ala Thr Ser Tnr p 
565 ^ 

^„ Ty. .13 val .la .ys oly val T.. ..u .la =ly Oln val .sn 
.sn =iu Ty .y» val .a. .ys val .ly .y- - 

... .la Z .la .1. .la sa. =1. =1„ .1- - ..o 

610 

■n riv Ser Ser Tyr Gin Gly Gin Asn 

Gly Lys ser Met Val Ser He Ala Gly Ser se y 

625 "° 

, ^ 71 ^r, tie ser Asp Asn Gly Lys Val He 
Gly Leu Ala He Gly Val Ser Arg He Ser Asp 

645 

^1 Thr Thr Asn ser Gin Gly Lys Thr Gly Val Ala 
He Arg Leu Ser Gly Thr Thr Asn ^ 



660 

Ala Gly val Gly Tyr Gin Trp 



675 



C) INFORMATION FOR SEQ ID NO: 16: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : un)cnown 

(D) TOPOLOGY: unknovm 

(ii) MOLECULE TYPE: DNA (genomic) 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16: 



CCGTGCTTGC CCAACACGCT T 



21 



(2) INFORMATION FOR SEQ ID NO: 17: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : unknown 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17: 

GCTGCCACCT TGCACAACAA C 21 

(2) INFORMATION FOR SEQ ID NO: 18: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 21 base pairs 
<B) TYPE: nucleic acid 

(C) STRANDEDNESS : unknown 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 18: 
CTTTCAATGC CAGAAAGTAG G 21 
(2) INFORMATION FOR SEQ ID NO: 19: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: unknown 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: DNA (genomic) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 19: 
CTTCAACCGT TGCGGACAAC A 21 
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CLAIMS 

We claim: 

1. A recombinant HoemopWus adhesion protein. 

2. A recombinant Haemophilus adhesion protein according to claim 1 which has 
5 a sequence homologous to that shown in Figure 2. 

3 A recombinant HaemopMus adhesion protein according to claim 1 which has 
a sequence homologous to the amino acid sequence shown in Figure 3. 

4. A recombinant Haemophilus adhesion protein according to claim 1 which has 
the sequence shown in Figure 2. 

10 5. A recombinant Haemophilus adhesion protein according to claim 1 which has 

the amino acid sequence shown in Figure 3. 



6. A 



recombinant nucleic acid encoding an Haemophilus adhesion protein. 



7. The nucleic acid of claim 6 comprising DN A having a sequence homologous to 
that shown in Figure 1 . 
,5 8. Thenudeicacidofclaim6comprisingDNAhavingasequencehomologoustc 

that shown in Figure 3. 

9. The nucleic acid of claim 6 comprising DNA capable of hybridiring to that shown 

in Figure 1- 

lOThenucleicacidofclaimbcompr^ing DNA capableofhybndtzingto that sho.. 

20 in Figure 3. 
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11. The nucleic acid of claim 6 comprising DN A having the sequence shown in 
Figure 1. 

12. The nucleic acid of claim 6 comprising DNA having the sequence shown in 
Figure 3. 

5 13. An expression vector comprising transcriptional and translational regulatorj' 

nucleic acid operably linked to nucleic acid encoding an Haemophilus adhesion 
protein. 

14. A host cell transformed with an expression vector comprising a nucleic acid 
encoding an Haemophilus adhesion protein. 

10 1 5. A method of producing an Haemophilus adhesion protein comprising: 

a) culturing a host cell transformed with an expressing vector comprising 
a nucleic acid encoding an Haemophilus adhesion protein: and 

b) expressing said nucleic acid to produce an //ae/nop/»/7«5adhesionproteia 

16. Avaccinecomprisingapharmaceuticallyacceptablecarrierandan//aemop;»i7iff 
15 adhesion protein for prophylactic or therapeutic use in generating an immune 

response. 

1 7. A vaccine according to claim 16 wherein said Haemophilus adhesion protein 
has a sequence homologous to that shown in Figure 2. 

1 8. A vaccine according to claim 16 wherein said Haemophilus adhesion protein 
20 has a sequence homologous to the amino acid sequence shown in Figure 3. 

19. Amonoclonalantibodycapableofbindingtoan//oemo/7/»7M5adhesionprotein. 
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20. A method of treating or preventing Haemophilus influenzae infection comprising 
administering the vaccine of claim 16. 

21 . A method of treating or preventing a Haemophilus influenzae infection according 
to claim 20 wherein said H. influenzae infection is caused by a non-typable H 

5 influenzae. 
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ATGAACAAAA 


TTTTTAACGT 


TATTTGGAAT 


GTTGTGACTC 


AAACTT6GGT 


T6TCGTATCT 


60 


GAACTCACTC 


GCACCCACAC 


CAAATGCGCC 


TCCGCCACC6 


TGGCGGTTGC 


CGTATTGGCA 


120 


ACCCTGTTGT 


CCGCAACGGT 


TGAGGCGAAC 


AACAATACTC 


CTGTTACGAA 


TAAGTTGAAG 


180 


GCTTATGGCG 


ATGC6AATTT 


TAATTTCACT 


AATAATTCGA 


TA6CA6AT6C 


AGAAAAACAA 


240 


GTTCAAGAGG 


CTTATAAAGG 


TTTATTAAAT 


CTAAATGAAA 


AAAATGCGAG 


TGATAAACTG 


300 


TTGGTQGAGG 


ACAATACTGC 


GGCGACCGTA 


GGCAATTTGC 


GTAAATT6GG 


CTGGGTATTG 


360 


TCTAGCAAAA 


ACGGCACAAG 


GAACGAGAAA 


AGCCAACAAG 


TCAAACATGC 


6GATGAAGTG 


420 


TTGTTTGAAG 


GCAAAGGCG6 


TGTGCA6GTT 


ACTTCCACCT 


CTGAAAACGG 


CAAACACACC 


480 


ATTACCTTTG 


CTTTAGCGAA 


AGACCTTGGT 


GTGAAAACTG 


CGACTGTGAG 


TGATACCTTA 


540 


ACGATTGGCG 


GTGGTGCTGC 


TGCAGGTGCT 


ACAACAACAC 


CGAAAGTGAA 


TGTAACTAGT 


600 


ACAACTGATG 


GCTTGAAGTT 


CGCTAAAGAT 


6CTGCGGGTG 


CTAATGGCGA 


TACTACGGTT 


660 


CACTTGAATG 


GTATTGGTTC 


AACCTTGACA 


GACACGCTTG 


T6GGTTCTCC 


TGCTACTCAT 


720 


ATTGACGGAG 


GAGATCAAAG 


TACGCATTAC 


ACTCGTGCAG 


CAA6TATCAA 


GGATGTCTTG 


780 


AATGCGGGTT 


GGAATATCAA 


6G6TGTTAAA 


GCTGGCTCAA 


CAACTGGTCA 


ATCAGAAAAT 


840 


GTCGATTTTG 


TTCATACTTA 


CGATACTGTT 


GAGTTCTTGA 


GTGCGGATAC 


AGAGACCACG 


900 


ACTGTTACTG 


TAGATA6CAA 


AGAAAACGGT 


AAGAGAACCG 


AAGTTAAAAT 


CGGTGCGAAG 


960 


ACTTCTGTTA 


TCAAAGAAAA 


AGACGGTAAG 


TTATTTACTG 


GAAAAGCTAA 


CAAAGAGACA 


1020 


AATAAAGTTG 


ATGGTGCTAA 


CGCGACTGAA 


GATGCAGACG 


AAGGCAAAGG 


CTTAGTGACT 


1080 


GCGAAAGATG 


TGATTGACGC 


AGTGAATAAG 


ACTGGTTGGA 


GAATTAAAAC 


AACCGATGCT 


1140 


AATGGTCAAA 


ATGGCGACTT 


CGCAACTGTT 


GCATCAGGCA 


CAAAT6TAAC 


CTTTGCTAGT 


1200 


GGTAATGGTA 


CAACTGCGAC 


TGTAACTAAT 


GGCACCGATG 


GTATTACCGT 


TAAGTATGAT 


1260 


GCGAAAGTTG 


GCGACGGCTT 


AAAACTAGAT 


GGCGATAAAA 


TCGCTGCAGA 
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CTTACTGTGA ATGATGGTA& 6AACGCTAAT AATCCGAAAG GTAAAGTGGC TGATGTTGCT 1380 

TCAACTGACG AGAAGAAATT GGTTACAGCA AAAGGTTTAG TAACAGCCTT AAACAGTCTA 1440 

AGCTGGACTA CAACTGCTGC TGAGGCGGAC GGT6GTACGC TTGAT6GAAA TGCAAGT6AG 1500 

CAAGAAGTTA AAGCGGGCGA TAAAGTAACC TTTAAAGCAG GCAAQAACTT AAAAGTGAAA 1560 

CAAGAGGGTG CGAACTTTAC TTATTCACTG CAAGATGCTT TAACAGGCTT AACGAGCATT 1620 

ACTTTAGGTA CAGGAAATAA TGGTGCGAAA ACT6AAATCA ACAAAGAC66 CTTAACCATC 1680 

ACACCAGCAA ATGGTGCGGG TGCAAATAAT GCAAACACCA TCAGCGTAAC CAAAGACGGC 1740 

ATTAGTGCGG GCGGTCAGTC GGTTAAAAAC GTTGTGAGCG GACTGAAGAA ATTTGGTGAT 1800 

GCGAATTTCG ATCCGCTGAC TAGCTCCGCC GACAACTTAA CGAAACAAAA TGACGATGCC I860 

TATAAAGGCT TGACCAATTT GGATGAAAAA GGTACAGACA AGCAAACTCC AGTTGTTGCC 1920 

QACAATACCG CCGCAACCGT GGGCGATTTG CGCGGCTTGG 6CTGGGTCAT TTCTGCGGAC 1980 

AAAACCACAG GCGGCTCAAC GGAATATCAC GATCAAGTTC GGAATGCGAA CGAAGTGAAA 2040 

TTCAAAAGCG GCAACGGTAT CAAT6TTTCC GGTAAAACGG TCAACGGTAG GCGTGAAATT 2100 

ACTTTTGAAT TGGCTAAA6G TGAAGTGGTT AAATCGAATG AATTTACCGT CAAAGAAACC 2160 

AATGGAAAG6 AAACGAGCCT G6TTAAAGTT GGCGATAAAT ATTACAGCAA AGAGGATATT 2220 

GACTTAACAA CAGGTCAGCC TAAATTAAAA GATGGCAATA CAGTTGCTGC GAAATATCAA 2280 

GATAAAGGTG GCAAAGTCGT TTCTGTAACG GATAATACTG AAGCTACCAT AACCAACAAA 2340 

GGTTCTGGCT ATGTAACAGG TAACCAAGTG GCAGATGCGA TTGCGAAATC AGGCTTTGAG 2400 

CTTGGCTTGG CTGATGAAGC TGATGCGAAA CGGGCGTTTG ATGATAAGAC AAAAGCCTTA 2460 

TCTGCTGGTA CAACGGAAAT TGTAAATGCC CACGATAAAG TCCGTTTTGC TAATGGTTTA 2520 

AATACCAAAG TGAGCGCGGC AACGGTGGAA AGCACCGATG CAAACGGCGA TAAAGTGACC 2580 

ACAACCTTTG TGAAAACCGA TGTGGAATTG CCTTTAACGC AAATCTACAA TACCGATGCA 2640 
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AAC6GTAAGA AAATCACTAA AGTTGTCAAA GAT6GGCAAA CTAAATGGTA TGAACTGAAT 2700 

6CTGACGGTA CGGCTGATAT GACCAAAGAA GTTACCCTCG GTAACGTGGA TTCAGACGGC 2760 

AAGAAAGTTG TGAAA6ACAA CGATGGCAAG TGGTATCAC6 CCAAAGCTGA CGGTACTGCG 2820 

GATAAAACCA AAGGCGAAGT GAGCAATGAT AAAGTTTCTA CCGATGAAAA ACACGTT6TC 2880 

AGCCTTGATC CAAATGATCA ATCAAAAGGT AAAGGTGTCG TGATTGACAA TGTGGCTAAT 2940 

GGCGATATTT CTGCCACTTC CACCGATGCG ATTAACGGAA GTCAGTTGTA TGCTGTGGCA 3000 

AAAGGGGTAA CAAACCTTGC TGGACAAGTG AATAATCTTG AGGGCAAAGT GAATAAAGTG 3060 

GGCAAACGTG CAGATGCAGG TACAGCAAGT GCATTAGCGG CTTCACAGTT ACCACAAGCC 3120 

ACTATGCCAG GTAAATCAAT GGTTGCTATT GCGGGAAGTA GTTATCAAGG TCAAAATGGT 3180 

TTAGCTATCG GGGTATCAAG AATTTCCGAT AATGGCAAAG TGATTATTCG CTTGTCAGGC 3240 

ACAACCAATA GTCAAGGTAA AACAGGCGTT GCAGCAGGTG TT6GTTACCA GTGG 3294 
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Met Asn Lys He P^e Asn Val He Trp Asn val Val Thr Gin Thr Trp 

val val val Ser Glu Leu Thr Arg Thr His Thr Lys Cys Ala Ser Ala 

20 25 
^ val Al. val Ala val Lau Ala Thr Leu Leu Ser Ala Thr Val Glu 

Ala ABn Asn ABn Thr Pro Val Thr A.n I.y= Leu Ly. Ala Tyr Gly Aap 

Ala L Phe ABU Phe Thr A.n Abu Ser II. Ala ABP Ala Glu Lya Gin 

V 0 

val Gin Glu Ala Tyr LyB Gly Leu Leu ABn Leu Abu Glu LyB ABn Ala 

85 ^" 
ser ABP Ly. Leu Leu Val Glu Abp Aen Thr Ala Ala Thr val Gly A.n 

100 -^"^ 
L.U Arc Lye Leu Gly Trp Val Leu Ser Ser Lye Aen Gly Thr Arg ABn 

115 ■^^^ 
Glu LyB ser Gin Gin val Lye HIb Ala A.P Glu Val Leu Phe Glu Gly 

Lye Gly Gly Val Gin Val Thr Ser Thr Ser Glu Aen Gly LyB HIb Thr 

III Thr Phe Ala Leu Ala Lye Asp Leu Gly val Lye Thr Ala Thr val 

165 

ser ABP Thr Leu Thr He Gly Gly Gly Ala Ala Ala Gly Ala Thr Thr 
180 

Thr pro LyB val ABn val Thr Ser Thr Thr ABP Gly Leu Ly. Phe Ala 
LyB ABP Ala Gly Ala A.n Gly A.P Thr Thr val Hi. L.u ABn Gly 

He Gly ser Thr Leu Thr Abp Thr Leu Val Gly Ser Pro Ala Thr Hie 

225 230 

r-Tr, Cot- Thr His Tyr Thr Arg Ala Ala Ser He 
He ASP Gly Gly Asp Gin Ser Thr his ryr 

245 

LyB ABP val Leu ABn Ala Oly Trp ABn He Lye Gly Val LyB Ala Gly 

ser Thr Thr Oly Gin ser Glu ABn val ABp Phe val Hi. Thr Tyr Abp 

275 280 
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Tlir Val Glu Phe Leu Ser Ala Asp Thr Glu Tlir Tlir Thr Val Thr Val 
290 295 300 

Asp Ser Lys Glu Asn Gly Lys Arg Thr Glu Val Lys lie Gly Ala Lys 
305 310 315 320 

Thr Ser Val lie Lys Glu Lys Asp Gly Lys Leu Phe Thr Gly Lys Ala 

325 330 335 

Asn Lys Glu Thr Asn Lys Val Asp Gly Ala Asn Ala Thr Glu Asp Ala 
340 345 350 

Asp Glu Gly Lys Gly Leu Val Thr Ala Lys Asp Val lie Asp Ala Val 
355 360 365 

Asn Lys Thr Gly Trp Arg He Lys Thr Thr Asp Ala Asn Gly Gin Asn 
370 375 380 

Gly Asp Phe Ala Thr Val Ala Ser Gly Thr Asn Val Thr Phe Ala Ser 
385 390 395 400 

Gly Asn Gly Thr Thr Ala Thr Val Thr Asn Gly Thr Asp Gly He Thr 
405 410 415 

Val Lys Tyr Asp Ala Lys Val Gly Asp Gly Leu Lys Leu Asp Gly Asp 
420 425 430 

Lys He Ala Ala Asp Thr Thr Ala Leu Thr Val Asn Asp Gly Lys Asn 
435 440 445 

Ala Asn Asn Pro Lys Gly Lys Val Ala Asp Val Ala Ser Thr Asp Glu 
450 455 460 

Lys Lys Leu Val Thr Ala Lys Gly Leu Val Thr Ala Leu Asn Ser Leu 
465 470 475 480 

Ser Trp Thr Thr Thr Ala Ala Glu Ala Asp Gly Gly Thr Leu Asp Gly 

485 490 495 

Asn Ala Ser Glu Gin Glu Val Lys Ala Gly Asp Lys Val Thr Phe Lys 
500 505 510 

Ala Gly Lys Asn Leu Lys Val Lys Gin Glu Gly Ala Asn Phe Thr Tyr 
515 520 525 

Ser Leu Gin Asp Ala Leu Thr Gly Leu Thr Ser He Thr Leu Gly Thr 
530 535 540 

Gly Asn Asn Gly Ala Lys Thr Glu He Asn Lys Asp Gly Leu Thr He 
545 550 555 560 

Thr Pro Ala Asn Gly Ala Gly Ala Asn Asn Ala Asn Thr He Ser Val 

565 570 575 

Thr Lys Asp Gly He Ser Ala Gly Gly Gin Ser Val Lys Asn Val Val 
580 585 590 
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Ser Gly Leu Lys Lys Phe Gly Asp Ala Asn Phe Asp Pro Leu Thr Ser 
595 600 605 

Ser Ala Asp Asn Leu Thr Lys Gin Asn Asp Asp Ala Tyr Lys Gly Leu 
610 615 620 

Thr Asn Leu Asp Glu Lys Gly Thr Asp Lys Gin Thr Pro Val Val Ala 
625 630 635 640 

Asp Asn Thr Ala Ala Thr Val Gly Asp Leu Arg Gly Leu Gly Trp Val 

645 650 655 

lie Ser Ala Asp Lys Thr Thr Gly Gly Ser Thr Glu Tyr His Asp Gin 
660 665 670 

Val Arg Asn Ala Asn Glu Val Lys Phe Lys Ser Gly Asn Gly lie Asn 
675 680 685 

Val Ser Gly Lys Thr Val Asn Gly Arg Arg Glu lie Thr Phe Glu Leu 
690 695 700 

Ala Lys Gly Glu Val Val Lys Ser Asn Glu Phe Thr Val Lys Glu Thr 
705 710 715 720 

Asn Gly Lys Glu Thr Ser Leu Val Lys Val Gly Asp Lys Tyr Tyr Ser 

725 730 735 

Lys Glu Asp lie Asp Leu Thr Thr Gly Gin Pro Lys Leu Lys Asp Gly 
740 745 750 

Asn Thr Val Ala Ala Lys Tyr Gin Asp Lys Gly Gly Lys Val Val Ser 
755 760 765 

Val Thr Asp Asn Thr Glu Ala Thr He Thr Asn Lys Gly Ser Gly Tyr 
770 775 780 

Val Thr Gly Asn Gin Val Ala Asp Ala He Ala Lys Ser Gly Phe Glu 
785 790 795 800 

Leu Gly Leu Ala Asp Glu Ala Asp Ala Lys Arg Ala Phe Asp Asp Lys 

805 810 815 

Thr Lys Ala Leu Ser Ala Gly Thr Thr Glu He Val Asn Ala His Asp 
820 825 830 

Lys Val Arg Phe Ala Asn Gly Leu Asn Thr Lys Val Ser Ala Ala Thr 
835 840 845 

Val Glu Ser Thr Asp Ala Asn Gly Asp Lys Val Thr Thr Thr Phe Val 
850 855 860 

Lys Thr Asp Val Glu Leu Pro Leu Thr Gin He Tyr Asn Thr Asp Ala 
865 870 875 880 

Asn Gly Lys Lys He Thr Lys Val Val Lys Asp Gly Gin Thr Lys Trp 

885 890 895 
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Tyr Glu Leu Asn Ala Asp 61y Thr Ala Asp Met Thr Lys Glu Val Tbr 
900 905 910 

Leu Gly Asn Val Asp Ser Asp Gly Lys Lys Val Val Lys . Asp Asn Asp 
915 920 925 

Gly Lys Trp Tyr His Ala Lys Ala Asp Gly Thr Ala Asp Lys Thr Lys 
930 935 940 

Gly Glu Val Ser Asn Asp Lys Val Ser Thr Asp Glu Lys His Val Val 
945 950 955 960 

Ser Leu Asp Pro Asn Asp Gin Ser Lys Gly Lys Gly Val Val lie Asp 

965 970 975 

Asn Val Ala Asn Gly Asp lie Ser Ala Thr Ser Thr Asp Ala lie Asn 
980 985 990 

Gly Ser Gin Leu Tyr Ala Val Ala Lys Gly Val Thr Asn Leu Ala Gly 
995 1000 1005 

Gin Val Asn Asn Leu Glu Gly Lys Val Asn Lys Val Gly Lys Arg Ala 
1010 1015 1020 

Asp Ala Gly Thr Ala Ser Ala Leu Ala Ala Ser Gin Leu Pro Gin Ala 
1025 1030 1035 1040 

Thr Met Pro Gly Lys Ser Met Val Ala lie Ala Gly Ser Ser Tyr Gin 

1045 1050 1055 

Gly Gin Asn Gly Leu Ala Xle Gly Val Ser Arg lie Ser Asp Asn Gly 
1060 1065 1070 

Lys Val lie lie Arg Leu Ser Gly Thr Thr Asn Ser Gin Gly Lys Thr 
1075 1080 1085 

Gly Val Ala Ala Gly Val Gly Tyr Gin Trp 
1090 1095 
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1 TTTNTrWTCTTATTTTTri'T^^ 
61 AAATATCACTTTTTTATTCTCCAAATATAaAATAGAATACGCACGATTT^C^^ 

121 GTATATTTAicATTAArPT^ATTAAATATiAGGTA^ 

181 GTTATTTGGAATGTTAT^^^^ 

241 ACCAAACGckcTCCGCAAaCGTGGAGACaG^^^^^ 

301 GTT«OGCGjATGCTACCekTG«OXTO«a«TTA^^^ 

361 GTGJTGAGCTTCaTTCCa^TAAAOAXOGi^CGO^ 

4.1 L^OGjGxjTXT.TTTCaic«T«.Goi^^^ 

4ei GCCGOCa.ci.CCTO««ic«AC»«« 
541 »CCT*CTCGCTGX««*GACCTCACAG^^^ 

601 TTTGGCGC«*CGGCGXTAiAGTTGATATi»CCAGTGXTicAA^^^ 

FGANGDKVDITbUAw 

661 ««CAGGTiACaG»AATGiTC»TTTG«iGaTTTGOATTC»A^ 

KTGNG NVHLNGLD&i*'^^ 

721 ACGAATACAaGTGTGTTAAGTTCATCAMixTTACACOT^^ 

TNTGVLSSSSFTfWA^ 

761 GCTGCAACTGTTAAAGATGiTTTA«TGci«3TJ»^ 

841 G0AGOTAATGTTGAGAGToiTGAT«AGTiKCGC«ATAAT«^^ 

GGNVBSVDLV5Ai« 

,01 G3CGAT«^CACGCTTGiTGTTGTATTiACA^^^ 
,61 GTGLTTCACACCGAAAAicKTGTTATiA^^ 

1031 LlGLT«CGACACA«TAAAGTTAciAGT«CACGGCGACT»^^^^ 

KENNDTNKVTSWi^ 

1061 GGT^TGGCTTAGTCACTG^AAAAGCTGTiATTGATG^^^^ 

nG.-3A 
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180 

240 

300 
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600 
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1080 
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1141 GTTAAAACAACTACT6CTAATGGTCAAAATGGCGACTTCGCAACTGTT6CGTCAGGCACA 1200 
VKTTTANGQNGDPATVASGT 

• ••••• 

1201 AATGTAACCTTTGAAAGTGGCGATGGTACAACAGCGTCAGTAACTAAAGATACTAACG6C 1260 
NVTFESGDGTTASVTKDTNG 

12 6 1 AATGGCATCACTGTTAAGTACGACGCGAAAGTTGGCGACGGCTTGAAATTTGATAGCGAT 1320 
NGITVKYDAKVGDGLKFDSD 

1321 AAAAAAATCGTTGCAGATACGACCGCACTTACTGTGACAGGTGGTAAGGTAGCTGAAATT 1380 
KKIVADTTALTVTGGKVAEI 

1381 GCTAAAGAAGATGACAAGAAAAAACTTGTTAATGCAGGCGATTTGGTAACAGCTTTAGGT 1440 
AKEDDKKKLVNAGDLVTALG 

1441 AATCTAAGTTGGAAAGCAAAAGCTGAGGCT6ATACTGATGGTGCGCTTGAGGGGATTTCA 1500 
NLSWKAK AEADTDGALEGIS 

1501 AAAGACCJ^GAAGTCAAAGCAGGCGAAACGGTAACCTTTAAAGCGGGCAAGAACTTAAAA 1560 
KDQEVKAGETVTPKAGKNLK 

.«•••• 

1561 GTGAAACAGGATGGTGCGAACTTTACTTATTCACTGCAAGATGCTTTAACGGGTTTAACG 162 0 
VKQDGANFTYSLQDALTGLT 

• , ♦ • • • 

1621 AGCATTACTTTAGGTGGTACAACTAATGGCGGAAATGATGCGAAAACCGTCATCAACAAA 1680 
S ITLG6TTNGGNDAKT VINK 

• • • • • • 

1681 GACGGTTTAACCATCACGCCAGCAGGTAATGGCGGTACGACAGGTACAAACACCATCAGC 1740 
DGLTITPAGNGGTTGTNTIS 

1741 GTAACCAAAGATG6CATTAAAGCAGGTAATAAAGCTATTACTAATGTTGCGAGTGGTTTA 1800 
VTKDGIKAGNKAITNVASGL 

1801 AGAGCTTATGACGATGCGAATTTTGATGTTTTAAATAACTCTGCAACTGATTTAAATAGA 1860 
RAYDDANFDVLNNSATDLNR 

1861 CACGTTGAAGATGCTTATAAAGGTTTATTAAATCTAAATGAAAAAAATGC AAATAAACAA 1920 
HVEDAYKGLLNLNEKNANKQ 

1921 CCGTTGGTGACTGACAGCACGGCGGCGACTGTAGGCGATTTACGTAAATTGGGTTGGGTA 1980 
PLVTDSTAATVGDLRKLGWV 

• • • • • 

1981 GTATCAACCAAAAACGGTACGAAAGAAGAAAGCAATCAAGTTAAACAAGCTGATGAAGTC 2040 
VSTKNGTKEESNQVKQADEV 

2041 CTCTTTACCGGAGCCGGTGCTGCTACGGTTACTTCCAAATCTGAAAACGGTAAACATACG 2100 
LFTGAGAATVTSKSENGKHT 

2101 ATTACCGTTAGTGTGGCTGAAACTAAAGCGGATTGCGGTCTTGAAAAAGATGGCGATACT 2160 
ITVSVAETKADCGLEKDGDT 

2161 ATTAAGCTCAAAGTGGATAATCAAAACACTGATAATGTTTTAACTGTTGGTAATAATG6T 2220 
IKLKVDNQNTDNVLTVGNKG 

2221 ACTGCTGTCACTAAAGGTGGCTTTGAAACTGTTAAAACTGGAGCGACTGATGCAGATCGC 2280 
TAVTKGGFETVKTGATDADR 
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2281 GGTAAAGTAACTGTAAAAGATGCTACTGCT^T^^^^^ "40 

2341 «»OATGTT^CCGC«™OTGCCGC0AC™ "»» 

KDVATAINSAAi' 

3.01 >^cc^'^^-^''^-f-^rf'V°^T'rt'rf'T 

2461 L*CC^AACCTTTJ^GiA«.T«*^icT^^ 
2521 lcTTTTa>^jTGOCG^CCJTG^G^^CTGCG«^^^ 

2581 LcGCGGG^lATACACCTxiAGGTGGCAci^CTOCG^CGCC^ 

iGGNTPTGGTTAiiTxv 

2641 ACGGCT«TG3TTrG«TTiTG=*«A5^^ 
2701 TATTTGAMGOTATTGCGAiAMTTTAAciGAOC^^ 

2761 GTTGATTTAAATOTOGATGioACGAAAAAiKX«TGCAa»AGTATT^^^ "20 
VDLHVDATKKSHAASIBBVi. 

2621 CGCGCAOGTTOG«TATTC^«5T«T«3™t«TC^ 
2681 LLA^CTTTACCGAT^^GCACA^jiACAACAA^^^ 

2,41 LGGCAAA^TGCTGACGiTAAAATC«5iGCG«AACTi^ 3000 
DGKGADVKieAriis5 

3001 |CCAAACTG™GGC«Aa.CCJG«i^^^ 

3061 LLGGaU«GACACCGaCACA^CTTAOTTACT«»^CTG^^^^ 3120 
DDGKDTGTGLVTAK1 

3121 AATAAAAGC^TJJ^AGGGTAACCG^^^ 

3181 LLtGCGGGT^CGCTOAAACCG^^^^^ 

3241 Lo^CclcAGCGtcCG^^ 

3301 «TGTTGGTGACGGCTrG«GATTGG™C|^ "60 

3361 CTTACTGTaIcAGGTGOT^GG^^^^^^^ 
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• • • • • • 

3421 AA6AAACTTGTTAAT6CA6A6G6TTTA6C6ACTGCTTTAAACAACCTAAGCTG6AC66CA 3480 
KKLVNAEGLATALNNLSWTA 

• ••••• 

3481 AAAGCCGATAAATAT6CAGATGGCGAGTCAGA66GCGAAACCGACCAAGAA6TCAAA6CA 3540 
KADKYADGESE6ETDQEVKA 

• • • • • « 

3541 6GCGACAAAGTAACCTTTAAAGCAGGCAAGAACTTAAAAGT6AAACAGTCTGAAAAA6AC 3600 
GDRVTFKA6KNLKVRQSEKD 

• • • • • • 

3601 TTTACTTATTCACT6CAAGACACTTTAACAGGCTTAACGAGCATTACTTTAGGTGGTACA 3660 
FTYSLQDTLTGLTSITLGGT 

• • • • ♦ , 

3661 GCTAATGGCAGAAATGATACGGGAACCGTCATCAACAAAGACGGCTTAACCATCACGCTG 3720 
ANGRNDT6TVINKDGLTITL 

• • • • « » 

3721 GCAAATGGTGCTGCGGCAGGCACAGATGCGTCTAACGGAAACACCATCAGTGTAACCAAA 3780 
ANGAAAGTDASNGNTISVTK 

• • • • • , 

3781 6ACGGCATTAGTGC6GGTAATAAAGAAATTACCAATGTTAAGAGTGCTTTAAAAACCTAT 3840 
DGISAGNKEITNVKSALKTY 

• • • • • • 

3841 AAAGATACTCAAAACACTGCAGATGAAACACAAGATAAAGAGTTCCACGCCGCCGTTAAA 3900 
KDTQNTADETQDKEFHAAVK 

• • • • • • 

3901 AACGCAAATGAAGTTGAGTTCGTGGGTAAAAACGGTGCAACCGTGTCTGCAAAAACTGAT 3960 
NANEV EFVGKNGATVSAKTD 

3961 AACAACGGAAAACATACTGTAACGATTGATGTTGCAGAAGCCAAAGTTGGTGATGGTCTT 4020 
NNGKHTVTIDVAEAKVGDGL 

• ••••• 

4021 6AAAAA6ATACTGACGGCAA6ATTAAACTCAAAGTA6ATAATACAGATGGGAATAATCTA 4 080 
EKDTDGKIKLKVDNTDGNNL 

• • • • » « 

4081 TTAACCGTTGATGCAACAAAAGGTGCATCCGTTGCCAAGGGCGAGTTTAATGCCGTAACA 414 0 
LTVDATKGASVAKGEFNAVT 

• • • • • * 

4141 AC AGATGCAACTAC AGCCCAAGGCACAAATGCCAATGAGCGCGGTAAAGTGGTTGTCAAG 42 00 
TDATTAQGTNANERGKVVVK 

• • • • « , 

4201 GGTTCAAATGGTGCAACTGCTACCGAAACTGACAAGAAAAAAGTGGC AACTGTTGGCGAC 4260 
GSNGATATETDKKKVATVGD 

• • • • • • 

4261 GTTGCTAAAGCGATTAACGACGCAGCAACTTTCGTGAAAGTGGAAAATGACGACAGTGCT 4320 
VAKAINDAATFVKVENDDSA 

• • • • • « 

4321 ACGATTGATGATAGCCCAACAGATGATGGCGCAAATGATGCTCTCAAAGCAGGCGACACC 4380 
TIDDSPTDDGANDALKAGDT 

• • • • « • 

43 81 TTGACCTTAAAAGCGGGTAAAAACTTAAAAGTTAAACGTGATGGTAAAAATATTACTTTT 4440 
LTLKAGKNLKVKRDGKNITF 

• • • • • • 

4441 GCCCTTGCGAACGACCTTAGTGTAAAAAGCGCAACCGTTAGCGATAAATTATCGCTTGGT 4500 
ALANDLSVKSATVSDKLSLG 

• • • • • « 

4501 ACAAACGGCAATAAAGTCAATATCACAAGCGACACCAAAGGCTTGAACTTCGCTAAAGAT 4560 
TNGNKVNITSDTKGLNFAKD 
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4561 AGTAAGACAGGCGATGATGCTAATATTCaiCTTAAATGGCATTGCTTCyACTTTAA^ 4620 
SKTGDDANIHLNGIASTLTD 

4621 ACATTGTTAAATAGT6GTGC6ACAACCAATTTAGGTGGTAATGGTATTACTGATAACGAG 4680 
TLLNSGATTNLGGN6ITDNE 

4681 AAAAAACGC6CG6CGAGCGTTAAAGATGTCTTGAATGCG6GTTGGAATGTTCGTGGTGTT 4740 
KKRAASVKDVLNAGWNVRGV 

4741 AAACCGGCATCTGCAAATAATCAAGTGGAGAATATC6ACTTTGTAGCAACCTACGACACA 48 
KPASANNQVENIDPVATYDT. 

• • ■ ■ 

4801 GTGGACTTTGTTAGTGGAGATAAAGACACCACGAGTGTAACTGTTGAAAGTAAAGATAAT 4860 
VDFVSGDKDTTSVTVESKDN 

4861 66CAAGAGAACC6AAGTTAAAATCGGTGCGAAGACTTCTGTTATCAAAGACCACAAC6GC 4920 
GKRTEVKI6AKTSVIKDHNG 

4921 AAACTGTTTACAGGCAAAGAGCT6AAGGATGCTAACAATAATGGCGTAACT6TTACCGAA 4980 
KLFT GKELKDANNNGVTVTE 

4981 ACCGACGGCAAAGACGAGGGTAATGGTTTAGTGACTGCAAAA6CTGTGATTGATGCC6TG 5040 
TDGKDEGN6LVTAKAVIDAV 

. , . • • 

5041 AATAAGGCTGGTTG6AGA6TTAAAACAACAGGTGCTAATGGTCAGAATGATGACTTCGCA 5100 
NKAGWRVKTTGANGQNDDFA 

5101 ACTGTTGCGTCAGGCACAAAT6TAACCTTTGCTGATGGTAATGGCACAACTGCCGAAGTA 5160 
TVASGTNVTFADGNGTTAEV 

5161 ACTAAAGCAAACGACGGTAGTATTACTGTTAAATACAATGTTAAAGTG6CTGATGGCTTA 5220 
TKANDGSITVKYNVKVADGL 

5221 AAACTAGACGGCGATAAAATCGTTGCAGACACGACCGTACTTACTGTGGCAGATGGTAAA 5280 
KLDGDKIVADTTVLTVADGK 

5281 GTTACAGCTCCGAATAATGGCGAT6GTAA6AAATTTGTTGATGCAAGTGGTTTAGCGGAT 534 0 
VTAPNNGD6KKFVDASGLAD 

• • • 

5341 GCGTTAAATAAATTAAGCTGGACGGCAACT6CTGGTAAAGAAGGCACTGGTGAAGTTGAT 5400 
ALNKLSWTATAGKE6TGEVD 

5401 CCTGCAAATTCAGCAGGGCAAGAAGTCAAAGCGG6CGACAAAGTAACCTTTAAAGCCGGC 5460 
PANSAGQEVKAGDKVTFKAG 

5461 GACAACCTgAaAATCAAAC^GCGGCAA^GACTTTACCTACTCGCTGAAAAAAGAGCTG 5520 
DNLKIKQSGKDFTYSLKKEL 

5521 aAAGACCTGACCAGCGTAGAGTTCAAAGACGCAAACGGCGGTACAGGCAGTGAAAGCACC 5580 
KDLTSVEFKDANGGTGSEST 

5581 aAGATTACCAAAGACGGCTTGACCATTACGCCGGCAAACGGTGCGGGTGCGGCAGGTGCA 5640 
KITKDGLTITPANGAGAAGA 

5641 AACACTGCAWVCACCATTAGCGTAACCAAAGATGGCATTAGCGCGGGTAATAAAGCAGTT 5700 

ntantisvtkdgisagnkav 
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5701 ACAAACGTTGTGAGCGGACTGAAGAAATTTGGTGATGGTCATACGTTGGCAAATGGCACT 5760 
TNVVSGLKKPGDGHTLANGT 

5761 GTTGCTGATTTTGAAAAGCATTATGACyUlTGCCTATAAAGACTTGACCAATTTGGATGAA 5820 
VADFEKHYDNAYKDLTNLDE 

5821 AAAGGCGCGGATAATAATCCGACTGTTGCCGACAATACCGCTGCAACCGTGQGCGATTTG 5880 
KGADNNPTVADNTAATVGDL 

5881 CGCGGCTTGGGCTGGGTCATTTCTGCGGACAAAACCACAGGCGAACCCAATCAGGAATAC 5940 
RGLGWVISADKTTGEPNQEY 

5941 AACGCGCAAGTGCGTAACGCCAATGAAGTGAAATTCAAGAGCGGCAACGGTATCAATGTT 6000 
NAQVRNANEVKFKSGNGINV 

6001 TCCGGTAAAACATTGAACGGTACGCGCGTGATTACCTTTGAATTGGCTAAAGGCGAAGTG 6060 
SGKTLNGTRVITFELAKGEV 

6061 GTTAAATCGAATGAATTTACCGTTAAGAATGCCGATGGTTCGGAAACGAACTTGGTTAAA 6120 
VKSNEFTVKNADGSETNLVK 

6121 GTTGGCGATATGTATTACAGCAAAGAGGATATTGACCCGGCAACCAGTAAACCGATGACA 6180 
VGDMYYSKEDIDPATSKPMT 

6181 GGTAAAACTGAAAAATATAAGGTTGAAAACGGCAAAGTCGTTTCTGCTAACGGCAGCAAG 6240 
GKTEKYKVENGKVVSANGSK 
* 

6241 ACCGAAGTTACCCTAACCAACAAAGGTTCCGGCTATGTAACAGGTAACCAAGTGGCTGAT 6300 
TEVTLTNKGSGYVTGNQVAD 
• 

6301 GCGATTGCGAAATCAGGCTTTGAGCTTGGTTTGGCTGAT6CGGCAGAAGCTGAAAAAGCC 6360 
AIAKSGFELGLADAAEAEKA 
• 

6361 TTTGCAGAAAGCGCAAAAGACAAGCAATTGTCTAAAGATAAAGCGGAAACTGTAAATGCC 6420 
FAESAKDKQLSKDKAETVNA 

*••••. 

6421 CACGATAAAGTCCGTTTTGCTAATGGTTTAAATACCAAAGTGAGCGCGGCAACGGTGGAA 6480 
HDKVRFANGLNTKVSAATVE 

• ♦ • • • , 

6481 AGCACTGATGCAAACGGCGATAAAGTGACCACAACCTTTGTGAAAACCGATGTGGAATTG 6540 
STDANGDKVTTTFVKTDVEL 

• • • • • « 

6541 CCTTTAACGCAAATCTACAATACCGATGCAAACGGTAATAAGATCGTTAAAAAAGCTGAC 6600 
PLTQIYNTDANGNKIVKKAD 

6601 GGAAAATGGTATGAACTGAATGCTGATGGTACGGCGAGTAACAAAGAAGTGACACTTGGT 6660 
GKWYELNADGTASNKEVTLG 

• • • • • • 

6661 AACGTGGATGCAAACGGTAAGAAAGTTGTGAAAGTAACCGAAAATGGTGCGGATAAGTGG 6720 
NVDANGKKVVKVTENGADKW 

• • • • • , 

6721 TATTACACCAATGCTGACGGTGCTGCGGATAAAACCAAAGGCGAAGTGAGCAATGATAAA 6780 
YYTNADGAADKTKGEVSNDK 

• • • • • a 

6781 GTTTCTACCGATGAAAAACACGTTGTCCGCCTTGATCCGAACAATCAATCGAACGGCAAA 6840 
VSTDEKHVVRLDPNNQSNGK 
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544 T. . .GNNGAKTEINiaX3LTITPANGAGANNANTISVTKIX5ISAGGQSVKN 590 

• • • • • 

541 VASGLRAYDDANFDVIiNNSATDLNRHVEDAYKGLLNLNEKNANKQ . PLVT 589 

I . I I I : : : I I I I I . I * • I I * : I « : : : I I I I I I I I : I I . * : I I 1:1. 
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1 *TOAAC«*A TTTTT«COT TATTTOG^AT OTTGTG&CTC AAACTTGOGT 
51 TOTCOTATCT GAACTCACTC OCACCCACXC CAAATOCOCC TCCGCCACCG 
101 TGGCAGTTOC CGTATTGOCA ACCCTGTTGT CCGCAACGOT TCAGGCGAAT 
151 GCTXCCGATG AAAACGAAGA TGATGAAGAA GAGTTAGAAC CCGTACAACG 
201 CTCTGTTTTA AOGTGGAGCT TCAAATCCGC TAAGOAAGGC ACTGGAGAAC 
251 AAGAGGGAAC AACAGAGGTA ATAAATTTGA AOACAGATTC ATCAGGAAAT 
301 GCAGTAGGAA GCAGCACAAT CACCTTCAAA GCCGGCGACA ACCTGAAAAT 
351 CAAACAAAGC GGCAATGACT TCACCTACTC GCTGAAAAAA GAGCTGAAAA 
401 ACCTGACCAG TGTTOAAACT GAAAAATTAT CGTTTGGCGC AAACGGOAAT 
451 AAAGTTGATA TTACCAGTGA TGCAAATGGO TTGAAATTGG CGAAAACAGG 
501 TAACGGAAAT GGTCAAAACA GTAATGTTCA CTTAAACGGT ATTGCTTCGA 
551 CTTTGACCGA TACGCTTK5CC GGTOGCACAA CAOGACACGT TGACACCAAC 
601 ATTGATGCGO TTAATTATCA TCGCGOTGCA AGCGTACAAO ATGTGTTAAA 
651 CAGCGGTTGG AATATCCAAG GCAATGGAAA CAATGTCGAT TTTGTCCGTA 
,01 CTTACGACAC CGTGGACTTT GTCAATGGCG CGAATGCCAA TGTGAGCGTT 
,51 ACGGCTCATA CGGCTCACAA AAAGACAACT GTCCGTGTGG ATGTAACAGG 
SOI CTTGCCGGTT CAATATGTTA CGGAAGACGG CAAAACCGTT GTGAAAGTGG 
851 GCAATGAGTA TTACAAAGCC AAAGATGACG GTTCGGCGGA TATGAATCAA 
,01 AAAGTCGAAA ACGGCGAGCT GGCGAAAACC AAAGTGAAAT TGGTATCGGC 
,51 AAGCGGTACA AATCCGGTGA AAATTAGOAA TGTTGCAGAC GOCACGGAAG 

FIG.-14A 
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1 rantiFHvivm wtqt«vws eltrthtkcx satvavavi* tij,satvqah 

51 ATDENEDDBE BI.EFVQRSVI, RWSFKSAKEG TaBQEGTTBV IBI^SSGH 

101 AVOSSTITFK AGDHLKIKQS GHDFTVSLKK ELFm-TSVET EKLSFGANGH 

151 KVI.ITSBAI.G LKLAKTGHGH GOKS»VHI^G lASTLTDTIA GGTTGHVBIN 

^01 IDAVNVHRAA SVQim.HSGW KIQGNG«HVD FVRTYDTVDF VNGAHAKVSV 

251 TAIWABKKTT VBVI,VTGI.PV QYVTEDGKTV VKVGHEYYKA KDDGSABMNQ 

301 KVENGELAKT KVKLVSASGT SPVFISMVAD GTBDTDAVSF KQLKAMDKQ 

351 VTLSTSSAYA HGGTDBTGGK ATQTLSNGI* FKFKSSDGEL I.K1SATGDTV 

401 TFTPKKGSVQ VGDDGKASIS KGAKTTEGLV BASELVESLN KI.GWKVGVEK 

«1 VGSGELDGTS KETLVKSGDK VTLKAGDHLK VKQBGTHFTY AIJ»E1.TGVK 
501 SVEFKDTANG A»GASTKITK DGLTTTUU.0 AHGATVTOAD KIKVASDGIS 
551 AGNKAVKNVA AGEISATSTD AIHGSQI.YAV MCGVTSIJVGQ VNHLEGKVKK 
601 VGKRADAGTA SALAASQLPQ ATMPGKSMVS lAGSSVQGQN GLAIGVSRIS 
651 DNGKVI1III.S GTTHSQGKTG VAAOVGYQW 

FIG.-15 
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1 MNKIFNVIWNWTQTWVWSELTRTHTKCASATVAVAVIATLLSATVEAN 50 

I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I i I i I I I I I I I i I : I I 
1 MNKIFNVIWNVVTQTWVVVSELTRTHTKCASATVAVAVLATIjIiSATVQAN 50 

• • • • • 

51 NNTPVTNKLKAYGDANFNFTNNSIADAEKQVQEAYKGLLNLNEKNASDKL 100 

51 ATD£NEDDEEELEPVQRSVIiRWSFKSAKEG« 80 

• • • • • 
101 LVEDNTAATV6NLRKL6WVI.SSKN6TRNEKSQQVKHADEVLFE6KGGVQV 150 

. I : : ill . . : . I : I : 

81 TGEQE6TTEVINL NTDSS6NAV6SST1TFKAGDNLKI 117 

• • • • • 

151 TSTSENGKHTITFALAKDLGVKTATVSDTLTIGGGAAAGATTTPKVNVTS 200 
. • : : : : I : . I I : t I . . . : . I . : I : • : • I I : : I I 

118 KQSGND FTYSLKKELKNIiTSVETEKIiSFGANGN KVDITS 156 

« • • ■ • 

201 TTDGLKFAKDAAGANGDTTVHLNGIGSTLTDTLVGSPATHXDGGDQSTHY 250 
. . : II I : I I . : . I . . . : . . II I I I i : I I I II II . I : . . . I ; I . . : . . : I 

157 DANGLKLAKTGNGKGQNSNVHLNGXASTLTDTLAGGTTGHVDTNIDAVNY 206 

• « . • • « 

2 51 TRAASIiOJVLNAGWNIKGVKAGSTTGQSENVDFVHTYDTVEFLSADTETT 300 
I I I I I I I I . I I I I . K : : I I I I I : I I M I : I : . : 

2 07 HRAASVQDVLNSGWNIQ GNGNNVDFVRTYDTVDFVNGANANV 248 

• « • • • 
301 TVTVDSKENGKRTEVKIGAKTSVIKEKOGKLFTGKANKETNKVDGANATE 350 

.11.1 I . : . : . ...III.. : I . . I : : . I . 

249 SVTADTAHKKTTVRVDVTGLPVQYVTEDGKTWKVGNEYYKAKDDGSADM 298 

• « • • • 

3 51 DADEGKGIiVTAKDVIDAVNKTGWRIKTTDANGQNGDFATVA SG 393 

: . . : . I : . • . I : I I I • : 

2 99 NQKVENGEIAKTKVKLVSASGTNPVKISNVADGTEDTDAVSFKQLKALQD 348 

• • • • « 

3 94 TNVTFASGNGTTATVTNG TDGITVKYDAKVGDGLKLDGDKI 434 

• . I I : . . : I : I : . . : I : . • I : • . • I : II:.: 

349 KQVTLSTSNAYANGGTDNDGGKATQTLSNGLNFKFKSSDGELIiKISA. • . 395 
« • • « • 

4 35 AADTTAIiTVNDG . • KNANNPKGKVADVASTDEKKLVTAKGLVTAIiNSIiSW 482 

. : I U . : U . . I . : : : . I : . : . . . I . I . I I I . I . : II . • M . I : I 
39 6 TGDTVTFTPKKGSVQVGDDGKASISKGANTTE.GIiVEASELVESLNKIiGW 444 

• « • • • 

4 83 TTTAAEAIX^TLIXJNASEQEVKAGDKVTPKAGKNLKVKQEGANFTYSLQD 532 

: : I . I I I . . . I I U I I I I I : I I I . I I I M I II . I II I . I . I 

44 5 KVGVEKVGSGELIXSTSKETLVKSGDKVTLIU^GDNLKVKQEGTNFTYAIjKD 494 



533 ALTGLTSITL . • . GTGNNGAKTEINKDGLTIT PANGAGANNANTISV 57 6 

.III:.!:.: : . I . I II . I . 1 . I II II II . I I M . . . : I : . I . I 
495 ELTGVKSVEFKDTANGANGASTKITKDGLTITLANGANGATVTDADKIKV 544 

577 TKDGISAGGQSVKNWSGLKKFGDANFDPLTSSADNLTKQNDDAYKGLTN 626 
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^' ^ S^us\'ui°ey relau to parts of the international application that do not comply with the prescribed requirements to such 
an extent that no meaningful international search can be carried out, specifically: 



□ 



bSe'ui'V dependent clwms «id «e not drafted in ,ccord»ce with the lecond »d third sentence* of Rule 6.4(»). 



Box II Ohservauoiis where unity of invention is lacking (Continuation of item 2 of first sheet) 



Thii InternnionjU Seirching Authority found multiple inventions in this intemaUonal applicauon. as follows: 



1. rn AS .11 required additionil sesrch fees were timely paid by the appUcanl. this iniemationaJ search report cover* aU 

searchable claims. 

2. □ AS all searchable claims could be searches without effort justifying an additional fee. this Authority did not invite payment 

of any additional fee. 

3 n AS only some of the required additional «»rch fees were timely p«d by the appUcant, this international search report 
' ' ' covers only those claims for which fees were paid. speaficaUy daims Noi.: 



4 n No required addiUonal search fees were timely paid by the appUcanu ConsequenUy this internauonal search report is 
' restrTci^ to the invention first mentioned in the claims; it is covered by daims Nos„ 



Remark Protest 



j [ The additional search fees were accompanied by the appUcant s protest. 

I I No protest accompanied the payment of additional search fees. 
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